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Crystal Structure Of Beta Site App Cleaving Enzvme (Bace\ And Me thods Of Use 

Thereof 

All documents cited in this text, and all documents cited or referenced in docpments cited in 
this text, and any manufacturer's instructions or catalogues for any products cited or 
5 mentioned in this text or in any document hereby incorporated into this text, are hereby 

incorporated herein by reference. Documents incorporated by reference into this text or any 
teachings therein may be used in the practice of this invention. Documents incorporated by 
reference into this text are not admitted to be prior art, Fxuthermore, authors or inventors on 
documents incorporated by reference into this text are not to be considered to be "another" 
10 or "others" as to the present inventive entity and vice versa, especially where one or more 
authors or inventors on documents incorporated by reference into this text are an inventor or 
inventors named in the present inventive entity. 

Field of the Invention 

The present invention relates to the mutant BACE proteins, recombinant BACE proteins, 
15 processes for crystallizing BACE and in particular to its crystal structure and to the uses of 
this structure in drug discovery. 

Background to the Invention 

AlTheimer's disease 

Alzheimer's disease (AD) is estimated to afQict more than 20 million people worldwide and 
20 is believed to be the most common form of dementia. Alzheimer's disease is a progressive 
dementia in which massive deposits of aggregated protein breakdown products - amyloid 
plaques and neurofibrillary tangles accumulate in the brain. The amyloid plaques are 
thought to be responsible for the mental decline seen in Alzheimer's patients. 

Ap or amyloid-p-protein is the major constituent of the plaques which are characteristic of 
25 Alzheimer's disease (De Strooper et al, 1999). Ap is a 39-42 residue peptide formed by the 
specific cleavage of a class I transmembrane protein called APP, or amyloid precursor 
protein. A P-secretase activity cleaves this protein between residues Met671 and Asp672 
(nxunbering of 770aa isofonn of APP) to form the N-terminus of Ap. A second cleavage of 
the peptide is associated with p-secretase to form the C-terminus of the AP peptide. 
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Beta Site APP Cleaving Enzyme (BACE) and Alzheimer's Disease 

Several groups have identified and isolated aspartate proteases that have p-secretase activity 
(Hussain et al,, 1999; Lin et al, 2000; Yan et. al, 1999; Sinha et. al., 1999 and Vassar et al., 

1999) . p-secretase is also known in the literature as Asp2 (Yan et. al, 1999), Beta site APP 
5 Cleaving Enzyme (BACE or BACEl) (Vassar et al., 1999) or memapsin-2 (Lin et al., 

2000) . BACE was identified using a number of experimental approaches such as EST 
database analysis (Hussain et al. 1999); expression cloning (Vassar et al. 1999); 
identification of human homologs from public databases of predicted C elegans proteins 
(Yan et al. 1 999) and finally utilizing an inhibitor to purify the protein from human brain 

10 (Sinha et al. 1999). Thus, five groups employing three different experimental approaches 
led to the identification of the same enzyme, making a strong case that BACE is a P- 
secretase. Mention is also made of the patent literature: WO96/40885, EP871720, U.S. 
Patents Nos. 5,942,400 and 5,744,346, EP855444, US 6,319,689, W099/64587, 
W099/31236, EP1037977, WOOO/17369, WOOl/23533, WO0047618, WOOO/58479, 

15 WOOO/69262, WOOl/00663, WO01/00665, US 6,313,268. 

BACE is a membrane bound type 1 protein that is synthesized as a partially active 
proenzyme, and is abmdantly expressed in brain tissue. It is thought to represent the major 
P-secretase activity, and is considered to be the rate-limiting step in the production of Ap. It 
is thus of special interest in the pathology of Alzheimer's disease, and in the development of 
20 drugs as a treatment for Alzheimer's disease. 

BACE was foimd to be a pepsin-like aspartyl proteinase, the mature enzyme consisting of 
the N-terminal catalytic domain, a transmembrane domain, and a small cytoplasmic domain. 
BACE has an optimum activity at pH 4.0-5.0 (Vassar et al, 1999) and is inhibited weakly by 
standard pepsin inhibitors such as pepstatin. It has been shown that the catalytic domain 
25 minus the transmembrane and cytoplasmic domain has activity against substrate peptides 
(Lin et al, 2000). Consequently, this soluble catalytic domain is suitable for crystallization 
studies and a crystal structure of this will give a representative structure of the BACE active 
site for the design of inhibitor molecules. 

The likelihood of developing Alzheimer's disease increases with age, and as the aging 
30 population of the developed world increases, this disease becomes a greater and greater 

problem. In addition to this, there is a famihal link to Alzheimer's disease and consequentiy 
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any individuals possessing the double mutation of APP known as the Swedish mutation (in 
which the mutated APP forms a considerably improved substrate for B ACE) have a much 
greater chance of developing AD, and also of developing it at an early age {see also US 
6,245,964 and US 5,877,399 pertaining to transgenic rodents comprising APP-Swedish). 
5 Consequently there is a strong case for developing a compound that can be used in a 
prophylactic fashion for these individuals. 

Hence, drugs that reduce or block BACE activity would reduce Ap levels and levels of 
fragments of Ap in the brain or elsewhere where Ap or fragments thereof deposit and thus 
slow the formation of amyloid plaques and the progression of AD or other maladies 
10 involving deposition of Ap or fragments thereof (Y ankner, 1996; De Strooper and Konig, 
1999). BACE is therefore an important candidate for the development of drugs as a 
treatment against Alzheimer's disease and/or against such other maladies. 

The therapeutic potential of inhibiting the deposition of Ap has motivated many groups to 
isolate and characterize secretase enzymes and to identify their potential inhibitors (see, 
15 e.g., WOOl/23533 A2, EP0855444, WOOO/17369, WOOO/58479, WOOO/47618, 

WOOO/77030, WOOl/00665, WOOl/00663, WOOl/29563, WO02/25276, US5,942,400, 
US6,245,884, US6,221,667, US6,21 1,235, WO02/02505, WO02/02506, WO02/02512, 
WO02/02518, WO02/02520, WO02/14264). 

The gene encoding APP is foimd on chromosome 21, which is also the chromosome found 
20 as an extra copy in Downs syndrome. Downs syndrome patients tend to acquire Alzheimers 
disease at an early age, with almost all those over 40 years of age showing Alzheimers-type 
pathology (Oyama et al., 1994). This is thought to be due to the extra copy of the APP gene 
found in these patients, which leads to overexpression of APP and therefore to increased 
levels of APPp causing the high prevalence of Alzheimers disease seen in this population. 
25 Thus inhibitors of BACE could be usefiil in reducing Alzheimers-type pathology in Down's 
syndrome patients. 

It would therefore be useftil to inhibit the deposition of Ap and portions thereof by 
inhibiting BACE through inhibitors designed from the BACE structure as provided herein. 
The determination of the three-dimensional structure of BACE provides a basis for the 
30 design of new and specific ligands for BACE. For example, knowing the three-dimensional 
structure of BACE, computer modelling programs may be used to design different 
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molecules expected to interact with possible or confirmed binding cavities or other 
structural or functional features of BACE or structure-based design approaches may used 
such as those described in Blxmdell et al (Nature Reviews, Drug Discovery, Vol 1, pg 45- 
54, 2002). 

5 Ideally it would be desirable to have an abimdant supply of this enzyme in homogenous 
form. It would also be preferable to solve the structure of a form of BACE with an 
unoccupied active site. This could be used to soak in small molecule inhibitors of the 
en2yme and to investigate their binding modes. We describe here the high yielding 
production of BACE firom bacterial cells in homogenous form, and the generation of protein 
10 suitable for crystallisation and structure determination of BACE in Apo form 

Protein Crystallisation 

It is well known in the art of protein chemistry that crystallising a protein is an xmcertain 
and difficult process without any clear expectation of success. It is now evident that protein 
crystallization is the main hurdle in protein structure determination. For this reason, protein 
15 crystallization has become a research subject in and of itself, and is not simply an extension 
of the protein crystallographer's laboratory. There are many references, which describe the 
difficulties associated with growing protein crystals (Kierzek AM. and Zielenkiewicz P. 
(2001) Biophysical Chemistry 91 1-20 Models of protein crystal growth; Wiencek JM 
(1999) Annu Rev Biomed Eng 1 505-534 New Strategies for crystal growth). 

20 The reasons why it is commonly held that crystallization of protein molecules from solution 
is the major obstacle in the process of determining protein structures are many; proteins are 
complex molecides, and the delicate balance involving specific and non-specific 
interactions with other protein molecules and small molecules in solution, is difficult to 
predict. 

25 Each protein crystallizes vinder a xmique set of conditions, which cannot be predicted in 
advance. Simply supersaturating the protein to bring it out of solution will not work, the 
result would, in most cases, be an amorphous precipitate. Many precipitating agents are 
xised, common ones are different salts, and polyethylene glycols, but others are known. In 
addition, additives such as metals and detergents can be added to modulate the behaviour of 

30 the protein in solution. Many kits are available (e.g., from Hampton Research), which 

attempt to cover as many parameters in crystallization space as possible, but in many cases 
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these are just a starting point to optimize crystalline precipitates and crystals which are 
unsuitable for diffiraction analysis. Successful crystallization is aided by knowledge of the 
proteins behaviour in terms of solubility, dependence on metal ions for correct folding or 
activity, interactions with other molecules and any other information that is available. Even 
5 so, crystallization of proteins is often regarded as a time-consuming process, whereby 
subsequent experiments build on observations of past trials. 

In cases where protein crystals are obtained, these are not necessarily always suitable for 
diffraction analysis; they may be limited in resolution, and it may subsequently be difficult 
to improve them to the point at which they will dif&act to the resolution required for 

10 analysis. Limited resolution in a crystal can be due to several things. It may be due to 
intrinsic mobility of the protein within the crystal; this can be difficult to overcome, even 
with other crystal forms. It may be due to high solvent content within the crystal, which 
consequently results in weak scattering, Altematively, it could be due to defects within the 
crystal lattice, which means that the diffracted x-rays will not be completely in phase jfrom 

15 imit to unit within the lattice. Any one of these or a combination of these could mean that 
the crystals are not suitable for structure determination. 

Some proteins never crystallize, and after a reasonable attempt it is necessary to examine 
the protein itself and consider whether it is possible to make individual domains, different N 
or C-terminal truncations, or point mutations. It is often hard to predict how a protein could 
20 be re-engineered in such a manner as to improve crystallisability. Sometimes the inclusion 
of a ligand in the crystallisation mixture is essential for the production suitable crystals. Our 
understanding of crystallisation mechanisms is still incomplete and the factors of protein 
structure, which are involved in crystallisation, are not well known. 

BACE Production for Crystallisation 

25 Beta secretase (BACE) is an integral membrane protein containing a signal sequence, a pro- 
peptide, a catalytic aspartyl protease domain, a transmembrane region and a C-terminal 
cytoplasmic region. During transit through the endoplasmic reticulum, Golgi apparatus and 
trans Golgi network the pro-peptide is cleaved by a furin-like protease (Bennett et al 2000, 
Creemers et al 2001) and N-glycosylation is added and matured (Haniu et al 2000). The 

30 protein contains 4 potential N-linked glycosylation sites, all of which are used (Bennett et 
al, 2000). 
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Certain active recombinant BACEs - different from those of the herein invention - have 
been produced using heterologous expression systems for mammalian cells (Vassar et al, 
1999, Hussain et al, 1999), insect cells (Mallender et al, 2001) and bacterial cells (Lin et al 
2000). Preferred constructs for crystallisation would be soluble and lack glycosylation: the 
5 former can be achieved by C-terminal truncation of the protein to remove the 

transmembrane and cytoplasmic regions; while glycosylation could be removed either by 
use of a deglycosylating agent such as PNGase F, by expression of the protein in bacteria or 
by mutation of the glycosylation sites. 

The protein used for BACE crystallisation by Hong et al (2000) was produced in bacteria 
10 and was truncated at the C-terminus. Their protein was produced as insoluble inclusion 
bodies and required refolding to give soluble, active protein. Refolding of BACE is made 
more complex by the presence of 3 disulphide bonds in the native protease domain, which 
require careful control of redox conditions to form during in-vitro refolding. The protein 
produced by Hong et al was a mixture of products and was crystallised with inhibitor bound 
15 (see WO 01/00663, WO 01/00665, and US 6,545,127). 

Mention is also made of WO 02/25276, which describes the crystallisation of BACE 
produced in mammalian cells. The protein produced also was a mixture of protein species 
and was also crystallized with an inhibitor bound. 

Mention is also made of WO03/0 12089, which describes the crystallisation of BACE 
20 produced from insect cells. The co-ordinates of BACE with an inhibitor bound are 
provided. 

Siimmarv of the Invention 

In general aspects, the present invention is concerned with the provision of a new, high 
resolution, apo, crystal form of BACE and the use of this structure in identifying or 

25 obtaining agent compounds (especially inhibitors of BACE) for modulating BACE activity, 
and in preferred embodiments identifying or obtaining actual agent compoimds/inhibitors. 
Crystal structure information presented herein is useful in designing potential inhibitors and 
modelling them or their potential interaction with the BACE binding cavity. Potential 
inhibitors may be brought into contact with BACE to test for ability to interact with the 

30 BACE binding cavity. Actual inhibitors may be identified from among potential inhibitors 
synthesized following design and model work performed in silico. An inhibitor identified 
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using the present invention may be formulated into a composition, for instance a 
composition comprising a pharmaceutically acceptable excipient, and may be used in the 
manufacture of a medicament for use in a method of treatment. 

Thus, according to a jBrst aspect of the present invention there is provided a mutant BACE 
5 protein, which protein lacks one or more proteolytic cleavage sites recognized by clostripain 
(or another protease which recognizes the same cleavage site as clostripain). In particular, 
the protein is a BACE protein, which comprises the sequence set out in residues 45 to 455 
of SEQ ID NO:2 (43 to 453 SwissProt P56817), or a fragment thereof comprising residues 
coixesponding to 58 to 398 of SEQ ID NO:2, modified by the following changes: (a) 
10 substitution or deletion of at least one residue which is a proteolytic cleavage site 

recognised by clostripain; and (b) optionally the replacement of from 1 to 30 other amino 
acids by an equivalent or fewer number of amino acids. It will be imderstood that when the 
BACE protein comprises a fragment as defined above, the fragment will comprise at least 
feature (a) and optionally feature (b). 

1 5 The modification is such that the BACE protein preferably retains at least one proteolytic 
cleavage site recognised by clostripain so that it may be cleaved to provide homogeneous 
location at which cleavage occurs. 

According to a second aspect of the present invention there is provided a mutant BACE 
protein which is truncated at the N-terminal up to and including R42, R45, G55, R56 or 
20 R57. In a preferred aspect, when the protein is truncated up and including R56 the residue 
at position 57 is not arginine. It may for example be lysine. 

In a third aspect the invention provides a mutant BACE protein selected from: (a) SEQ ID 
6; (b) SEQ ID 8; (c) SEQ ID 10; (d) SEQ ID 12; (e) SEQ ID 14; (f) SEQ ID 16; (g) SEQ 
ID 18; (h) SEQ ID 19; (i) SEQ ID 20; Q) SEQ ID 21. 

25 In another aspect, the invention contemplates a nucleic acid (e.g. DNA or RNA) sequence 
encoding the BACE protein of the invention, as well as the complementary nucleic acid 
sequence counterpart. 

The nucleic acids of the invention may be isolated, or may be present in the context of a 
vector or host cell. Thus, in another aspect, the invention contemplates a vector comprising 
30 the nucleic acid of the invention. 
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The nature of the vector of the mvention is not critical to the invention. Any suitable vector 
may be used, including expression vectors, plasmid, virus, bacteriophage, transposon, 
minichromosome, liposome or mechanical carrier. 

The expression vectors of the invention are DNA constructs suitable for expressing DNA 
5 which encodes the desired peptide and which may include: (a) a regulatory element (e.g. a 
promoter, operator, activator, repressor and/or enhancer), (b) a structural or coding sequence 
which is transcribed into mRNA and (c) appropriate transcription, translation, initiation and 
termination sequences. They may also contain sequence encoding any of various tags (e.g. 
to facilitate subsequent purification of the expressed protein, such as affinity (e.g. His tags). 

10 Particularly preferred are vectors which comprise an expression element or elements 

operably linked to the DNA of the invention to provide for expression thereof at suitable 
levels. Any of a wide variety of expression elements may be used, and the expression 
element or elements may for example be selected from promoters, enhancers, ribosome 
binding sites, operators and activating sequences. Such egression elements may comprise 

15 an enhancer, and for example may be regulatable, for example being inducible (via the 
addition of an inducer). 

The vector may further comprise a positive selectable marker and/or a negative selectable 
marker. The use of a positive selectable marker facilitates the selection and/or identification 
of cells containing the vector. 

20 In another aspect, the invention contemplates a host cell comprising the vector of the 

invention. The nucleic acid of the invention may be intrdoduced into the host cell by any of 
a large number of convenient methods, including calcium phosphate transfection, DEAE- 
Dextran mediated transfection, electroporation or any other method known in the art. 

Any suitable host cell may be used, including prokaryotic host cells (such as Escherichia 
25 colly Streptomyces spp. and Bacillus subtilis) and eukaryotic host cells. Suitable eukaryotic 
host cells include insect cells (e.g. using the baculovirus expression system), mammalian 
cells, fungal (e.g. yeast) cells and plant cells. Preferred mammalian cells are animal cells 
such as CHO, COS, C 127, 3T3, HeLa, HEK 293, NIH 3T3, BHK and Bowes melanoma 
(particularly preferred being CHO-Kl, COS7, Yl adrenal and carcinoma cells). 
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Cell-jfree translation systems can also be used to produce the peptides of the invention. 
Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts 
are described in Sambrook et al.. Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor, N.Y., (1989). 

5 Prokaryotic host cells are preferred in circumstances where the BACE protein is required in 
an unglycosylated state. 

According to another aspect of the invention there is provided a process for producing the 
BACE protein of the invention comprising the steps of: (a) culturing the host ceil of the 
invention under conditions suitable for expression of the BACE protein; and optionally (b) 
10 isolating the expressed recombinant BACE protein. 

In a further aspect the invention provides a method of making BACE protein which 
comprises proteolytically cleaving a BACE protein which lacks one of more proteolytic 
cleavage sites as described above, the cleavage desirably occurring at (and including) one of 
position 42, 45, 55, 56 or 57, preferably 42, 56 or 57. Clostripain, or another protease 
15 which recognises the same cleavage site as clostripain, may be used. 

Thus the resulting BACE protein of this aspect of invention will be a protein whose N- 
tenninal corresponds to 45, 48, 58, 59 or 60 of SEQ ID NO:2, and whose C-terminal region 
extends to and includes at least 398 of SEQ ID NO:2. Preferably the C-terminal region 
terminates at a residue between a point corresponding to and including 398 up to and 
20 including 455. This BACE protein may additionally comprise a C-terminal tag, such as a 
tag comprising firom 5 to 15 residues, such as a his tag or the like. 

In another aspect of the invention there is provided a process for producing refolded 
recombinant BACE protein comprising the steps of: (a) solubilising the recombinant 
BACE; (b) diluting the solubilised BACE into an aqueous buffer containing sulfobetaine 
25 (for example at a concentration of 10 to 50 mM, for example 10 mM); and (c) maintaining 
the diluted solution at low temperature (for example, 3 to 6'C) and at high pH (e.g. 9 to 
10,5) for at least 2 weeks (typically 3 weeks, more typically 4 weeks). 

In another aspect the invention provides a process for producing a crystal of BACE 
comprising the step of growing the crystal by vapour difiusion using a reservoir buffer that 
30 contains 18-26 % PEG 5000 MME (for example, 20-24 % PEG 5000 MME, e.g. 20-22.5 % 
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PEG 5000 MME), 180-220 mM (e.g. 200 mM) ammonium iodide and 180-22- mM (e.g. 
200 mM) txi-sodium citrate (pH 6.4-6.6). In a further aspect the reservoir buffer may 
additionally comprise from 0 to 5% (v/v) glycerol, for example 2.5% v/v. 

In another aspect the invention provides various BACE crystals, including a crystal of 
5 BACE having a hexagonal space group P6i22 (and optionally having unit cell dimensions 
of a=b=103.2 A, c=169.1 A, a=p=60^ 7=120*^, and aunit cell variability of 5% in all 
dimensions); a crystal of BACE having a resolution better than 3 A (for example, better 
than 2.5 A, e.g. better than 1 .8 A), and a crystal of BACE comprising a structure defined by 
all or a portion of the co-ordinates of Table 1. 

1 0 hi another aspect the invention provides a three-dimensional representation of BACE or of a 
portion of BACE, which representation comprises all or a portion of the coordinates of 
Table 1. The representation is prefembly a BACE model. 

The invention also contemplates a three-dimensional representation of a compound which 
fits the BACE model of the invention. 

15 The invention also contemplates a computer-based method for the analysis of the interaction 
of a molecular structure with a BACE structure of the invention, which comprises: (a) 
providing a BACE model; (b) providing a molecular structure to be fitted to said BACE 
model; and (c) fitting the molecular structure to the BACE model to produce a compound 
model. 

20 hi another aspect the invention provides a computer-based method for the analysis of the 
interaction of a molecular structure with a BACE structure of the invention, which 
comprises: (a) providing the structure of a BACE as defined by the coordinates of Table 1; 
(b) providing a molecular structure to be fitted to said BACE structure; and (c) fitting the 
molecular structure to the BACE structure of Table 1 . 

25 In another aspect the invention provides a computer-based method for the analysis of 

molecular structures which comprises: (a) providing the coordinates of at least two atoms of 
a BACE structure as defined in Table 1 ("selected coordinates"); (b) providing the structure 
of a molecular structure to be fitted to the selected coordinates; and (c) fitting the structure 
to the selected coordinates of the BACE structure. 



wo 2004/01 1641 PCT/GB2003/003200 

11 

In another aspect the invention provides a computer-based method of rational drug design 
comprising comprising: (a) providing the coordinates of at least two atoms of a BACE 
structure as defined in Table 1 ("selected coordinates"); (b) providing the structures of a 
plurality of molecular firagments; (c) fitting the structure of each of the molecular fi-agments 
5 to the selected coordinates; and (d) assembling the molecular firagments into a single 
molecule to form a candidate modulator molecule. 

In another aspect the invention provides a method for identifying a candidate modulator 
(e.g. candidate inhibitor) of BACE comprising the steps of: (a) emplo3dng a three- 
dimensional structure of BACE, at least one sub-domain thereof, or a plurality of atoms 
10 thereof, to characterise at least one BACE binding cavity, the three-dimensional structure 
being defined by atomic coordinate data according to Table 1; and (b) identifying the 
candidate modulator by designing or selecting a compound for interaction with the binding 
cavity. 

In another aspect the invention provides a method for identifying an agent compound (e.g. 
15 an inhibitor) which modulates BACE activity, comprising the steps of: (a) employing three- 
dimensional atomic coordinate data according to Table 1 to characterise at least one (e.g. a 
plurality of) BACE binding site(s); (b) providing the structure of a candidate agent 
compound; (c) fitting the candidate agent compoimd to the binding sites; and (d) selecting 
the candidate agent compound. 

20 In another aspect the invention provides a method of assessing the ability of a candidate 
modulator to interact with BACE which comprises the steps of: (a) obtaining or 
synthesising said candidate modulator; (b) forming a crystallized complex of BACE and 
said candidate modulator; and (c) analysing said complex by X-ray crystallography or NMR 
spectroscopy to determine the ability of said candidate modulator to interact with BACE. 

25 In another aspect the invention provides a method for determining the structure of a 

compoimd boimd to BACE, said method comprising: (a) mixing BACE with the compound 
to form a B ACE-compound complex; (b) crystallizing the BACE-compoxmd complex; and 
(c) determining the structure of said BACE-compound(s) complex by reference to the data 
of Table 1, 

30 In another aspect the invention provides a method for determining the structure of a 

compound boimd to BACE, said method comprising: (a) providing a crystal of BACE; (b) 
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soaking the crystal with one or more compound(s) to form a complex; and (c) determining 
the structure of the complex by emplojdng the data of Table 1. 

In another aspect the invention provides a method of detemiining the three dimensional 
structure of a BACE homologue or analogue of imknown structure, the method comprising 
5 the steps of: (a) aligning a representation of an amino acid sequence of the BACE 

homologue or analogue with the amino acid sequence of the BACE of Table 1 to match 
homologous regions of the amino acid sequences; (b) modelling the structure of the 
matched homologous regions of said target BACE of unknown structure on the 
corresponding regions of the BACE structure as defined by Table 1 ; and (c) determining a 
10 conformation for the BACE homologue or analogue which substantially preserves the 
structure of said matched homologous regions. 

In another aspect the invention provides a method of providing data for generating 
structvires and/or performing rational drug design for BACE, BACE homologues or 
analogues, complexes of BACE with a potential modulator, or complexes of BACE 
homologues or analogues with potential modulators, the method comprising: (i) establishing 
communication with a remote device containing computer-readable data comprising at least 
one of: (a) atomic coordinate data according to Table 1, said data defining the three- 
dimensional structure of BACE, at least one sub-domain of the three-dimensional structure 
of BACE, or the coordinates of a plurality of atoms of BACE; (b) structure factor data for 
BACE, said structure factor data being derivable firom the atomic coordinate data of Table 
1; (c) atomic coordinate data of a target BACE homologue or analogue generated by 
homology modelling of the target based on the data of Table 1; (d) atomic coordinate data 
of a protein generated by interpreting X-ray crystallographic data or NMR data by reference 
to the data of Table 1 ; and (e) structure factor data derivable from the atomic coordinate 
data of (c) or (d); and (ii) receiving said computer-readable data from said remote device. 

In another aspect the invention provides a computer system containing one or more of: (a) 
atomic coordinate data according to Table 1, said data defining the three-dimensional 
structure of BACE or at least selected coordinates thereof; (b) structure factor data (where a 
structure factor comprises the amplitude and phase of the diffracted wave) for BACE, said 
30 structure &ctor data being derivable from the atomic coordinate data of Table 1 ; (c) atomic 
coordinate data of a target BACE protein generated by homology modelling of the target 
based on the data of Table 1; (d) atomic coordinate data of a target BACE protein generated 



20 
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by interpreting X-ray crystallographic data or NMR data by reference to the data of Table 1 ; 
or (e) structure factor data derivable from the atomic coordinate data of (c) or (d). 

In another aspect the invention provides a computer-readable storage medium, comprising a 
data storage material encoded with computer readable data, wherein the data are defined by 

5 all or a portion of the structure coordinates of B ACE of Table 1, or a homologue of B ACE, 
wherein said homologue comprises backbone atoms that have a root mean square deviation 
firom the Ca or backbone atoms (nitrogen-carboiia-carbon) of Table 1 of less than 2.0 A, 
preferably less than 1 .5 A, more preferably less than 1 .0 A, even more preferably less than 
0.74 A, even more preferably less than 0.72 A and most preferably less than 0.5 A when 

10 superimposed on the coordinates provided in Table 1 for the residue backbone atoms. 

In another aspect the invention provides a computer-readable data storage medium 
comprising a data storage material encoded with a first set of computer-readable data 
comprising a Fourier transform of at least a portion (e.g. selected coordinates as defined 
herein) of the structural coordinates for BACE according to Table 1; which, when combined 
15 with a second set of machine readable data comprising an X-ray diffraction pattern of a 
molecule or molecular complex of imknown structure, using a machine programmed with 
the instructions for using said first set of data and said second set of data, can determine at 
least a portion of the structure coordinates corresponding to the second set of machine 
readable data. 

20 In another aspect the invention provides a computer readable medium with at least one of: 
(a) atomic coordinate data according to Table 1 recorded thereon, said data defining the 
three-dimensional structure of BACE, or at least selected coordinates thereof; (b) structure 
factor data for BACE recorded thereon, the stmcture factor data being derivable from the 
atomic coordinate data of Table 1; (c) atomic coordinate data of a target BACE protein 

25 generated by homology modelling of the target based on the data of Table 1 ; (d) atomic 
coordinate data of a BACE-ligand complex or a BACE homologue or analogue generated 
by interpreting X-ray crystallographic data or NMR data by reference to the data of Table 1; 
and (e) structure factor data derivable from the atomic coordinate data of (c) or (d). 

In another aspect the inventiou provides a method for determining the structure of a protein, 
30 which method comprises; providing the co-ordinates of Table 1, and either (a) positioning 
the co-ordinates in the crystal imit cell of said protein so as to provide a structure for said 
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protein or (b) assigning NMR spectra Peaks of said protein by manipulating the coordinates 
of Table L 

In another aspect the invention contemplates BACE modulator molecules, medicaments, 
pharmaceutical compositions and drugs obtainable by, or obtained by, the processes and 
5 methods of flie invention, and to methods of therapy (e.g. the treatment of Alzheimer's 
disease) using such products. 

It is to be understood that, except where explicitly stated otherwise, references herein to 
'"BACE protein" or "BACE peptide", "mutant BACE protein" or "mutant BACE peptide" 
and to "BACE protein" or "BACE peptide", as well as references to any of the foregoing 
10 which are further defined inter alia by reference to one or more specific amino acid 
sequences, are intended to cover BACE homologues, allelic forms, species variants, 
derivatives and muteins thereof (as defined below). 

Thus, references to mutant BACE proteins having particular amino acid sequences may 
optionally be interpreted to cover the corresponding homologues, allelic forms, species 
15 variants, derivatives and muteins (as defined below) of that particular BACE amino acid 
sequence. 

Definitions 

Where used herein and unless specifically indicated otherwise, the following terms are 
intended to have the following meanings in addition to any broader (or narrower) meanings 
20 the terms might enjoy in the art: 

The term "isolated" is used herein to indicate that the isolated moiety (e.g. peptide or 
nucleic acid) exists in a physical milieu distinct from that in which it occurs in nature. For 
example, the isolated peptide may be substantially isolated with respect to the complex 
cellular milieu in which it naturally occurs. The absolute level of purity is not critical, and 
25 those skilled in the art can readily determine appropriate levels of purity according to the 
use to which the peptide is to be put. The term "isolating" when used a step in a process is 
to be interpreted accordingly. 

In many circumstances, the isolated moiety will form part of a composition (for example a 
30 more or less crude extract containing many other molecules and substances), buffer system. 
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matrix or excipient, which may for example contain other components (including proteins, 
such as albumin). 

In other circumstances, the isolated moiety may be pxirified to essential homogeneity, for 
example as determined by PAGE or colimm chromatography (for example HPLC or mass 
5 spectrometry). In preferred embodiments, the isolated peptide or nucleic acid of the 
invention is essentially ttie sole peptide or nucleic acid in a given composition. 

The proteins and nucleic acids of the invention need not be isolated in the sense defined 
above, however. For example, more or less crude culture supematants (e.g. "spent" 
medium) may contain sufficient concentrations of the proteins or nucleic acids of the 

10 invention for use in several applications. Preferably, such supematants are firactionated 
and/or extracted, but in many circumstances they may be used without pretreatment. They 
are preferably derived from spent media used to culture the host cells of the invention (for 
example, the bacterial sources described infra). The supematants are preferably sterile. They 
may be treated in various ways, for example by concentration, filtration, centrifugation, 

15 spray drying, dialysis and/or lyophilisation. Conveniently, the culture supematants are 
simply centrifuged to remove cells/cell debris and filtered. 

The term "pharmaceutical composition" is used herein to define a solid or liquid 
composition in a form, concentration and level of purity suitable for administration to a 
patient (e.g. a hiunan or animal patient) upon which administration it can elicit the desired 
20 physiological changes. 

The term "recombinant" as applied to the proteins of the invention is used herein to define a 
protein that has been produced by that body of techniques collectively known as 
"recombinant DNA technology" (for example, using the nucleic acid, vectors and or host 
cells described herein). 

25 The term "synthetic" as applied to the peptides of the invention is used herein to define a 
peptide that has been chemically synthesised in vitro (for example by any of the 
commercially available solid-phase peptide-synthesis systems). 

As used herein in relation to the vectors of the mvention, the term "operably linked" refers 
to a condition in which portions of a linear nucleic acid sequence are capable of influencing 
30 the activity of other portions of the same linear nucleic acid sequence. For example, DNA 
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for a signal peptide (secretory leader) is operably linked to DNA for a polypeptide if it is 
expressed as a precursor which participates in the secretion of the polypeptide; a promoter is 
operably linked to a coding sequence if it controls the transcription of the sequence; a 
ribosome binding site is operably linked to a coding sequence if it is positioned in the 
5 correct reading-frame so as to permit translation. 

By "apo-structure" we mean the three-dimensional structure of the protein that contains no 
ligand, e.g. substrate or product or cofactor or inhibitor i.e. the active site of the protein is 
empty. 

hi the following by "binding site" or "binding cavity*' we mean a site (such as an atom, a 
10 functional group of an amino acid residue or a plurality of such atoms and/or groups) in a 
BACE binding cavity, which may bind to an agent compound such as a candidate inhibitor. 
Depending on the particular molecule in the cavity, sites may exhibit attractive or repulsive 
binding interactions, brought about by charge, steric considerations and the like. 

Binding sites are sites within a macromolecule, or on its surface, at which ligands can bind. 

15 Examples are the catalytic or active site of an enzyme (the site on an enzyme at which the 
amino acid residues involved in catalysing the enzymatic reaction are located), allosteric 
binding sites (ligand binding sites distinct from the catalytic site, but which can modulate 
enzymatic activity upon ligand binding), cofactor binding sites (sites involved in 
binding/co-ordinating cofactors e.g. metal ions), or substrate binding sites (the ligand 

20 binding sites on a protein at which the substrates for the enzymatic reaction bind). There 
are also sites of protein-protein interaction. 

In the following by "active site" we mean a site (such as an atom, a functional group of an 
amino acid residue or a plurality of such atoms and/or groups) in a BACE binding cavity, 
which is involved in catalysis. 

25 By "fitting", is meant determining by automatic, or semi-automatic means, interactions 
between one or more atoms of a candidate molecule and at least one atom of a BACE 
structure of the invention, and calculating the extent to which such interactions are stable. 
Interactions include attraction and repulsion, brought about by charge, steric considerations 
and the like. Various computer-based methods for fitting are described further herein. 
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By "root mean square deviation" we mean the square root of the arithmetic mean of the 
squares of the deviations from the mean. 

By a "computer system" we mean the hardware means, software means and data storage 
means used to analyse atomic coordinate data. The minimum hardware means of the 
5 computer-based systems of the present invention typically comprises a central processing 
unit (CPU), input means, output means and data storage means. Desirably a monitor is 
provided to visualise stracture data. The data storage means may be RAM or means for 
accessing computer readable media of the invention. Examples of such systems are 
microcomputer workstations available from Silicon Graphics Incorporated and Sun 
10 Microsystems running Unix based, Windows NT or IBM OS/2 operating systems. 

By "computer readable media" we mean any medium or media, which can be read and 
accessed directly by a computer e.g. so that the media is smtable for use in the above- 
mentioned computer system. Such media include, but are not limited to: magnetic storage 
media such as floppy discs, hard disc storage medium and magnetic tape; optical storage 
15 media such as optical discs or CD-ROM; electrical storage media such as RAM and ROM; 
and hybrids of these categories such as magnetic/optical storage media. 

The term "homologue" is used herein in two distinct senses. It is used sensu stricto to define 
proteins that share a common ancestor. In this sense it covers orthologues (species variants 
which have diverged in different organisms foUoAving a speciation event) and paralogues 
20 (variants which have diverged within the same organism after a gene duplication event). 
Thus, there is a direct evolutionary relationship between such homologues and this may be 
reflected in structural and/or frmctional similarities. For example, orthologues may perform 
the same role in each organism in which they are found, while paralogues may perform 
functionally related (but distinct) roles within the same organism. 

25 The term is also used herein sensu lato to define proteins which are to some extent 
structurally similar (i.e. not necessarily evolutionary related and/or structurally and 
functionally equivalent). In this sense, homology is recognised on the basis of purely 
structural criteria by the presence of amino acid sequence identities and/or conservative 
amino acid changes and/or similar secondary, tertiary or quaternary structures. 

30 The term "analogue" is used herein to define proteins with similar functions and/or 

structures and which are not necessarily evolutionary related. Protein analogues which 
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share function but which have no or little structural similarities are likely to have arisen by 
convergent evolution. Conversely, protein analogues which share structural similarities but 
which exhibit few or no functional similarities are likely to have arisen by divergent 
evolution. Protein analogues may be identified, for example, by screening a library of 
5 proteins to detect those with similar function(s) but different physical properties, or by 
screening for proteins which share structural features but not necessarily any functions (e.g. 
by immunological screening). 

The term "equivalent" is used herein to define those protein analogues which exhibit 
substantially the same function(s) and which share at least some structural features (e.g. 

10 functional domains), but which have not evolved firom a common ancestor. Such 

equivalents are typically synthetic proteins (see below) and may be generated, for example, 
by identifying sequences of functional importance (e.g. by identifying conserved or 
canonical sequences, functional domains or by mutagenesis followed by functional assay), 
selecting an amino acid sequence on that basis and then synthesising a peptide based on the 

15 selected amino acid sequence. Such synthesis can be achieved by any of many different 
methods known in the art, including solid phase peptide synthesis (to generate synthetic 
peptides) and the assembly (and subsequent cloning) of oligonucleotides. Some synthetic 
protein analogues may be chimaeras (see below), and such equivalents can be designed and 
assembled for example by concatenation of two or more different structural and/or 

20 functional peptide domains firom different proteins using recombinant DNA techniques (see 
below). 

The BACE protein homologues of the invention therefore include proteins and peptides 
having at least 75%, 80%, 85%, 90%, 95%, 97%, 98% or 99% sequence identity with the 
reference protein, and include truncated forms of the BACE proteins of the invention. Such 
25 truncates are preferably at least 25%, 35%, 50% or 75% of the length of the corresponding 
specifically exemplified proteins and may have at least 60% sequence identity (more 
preferably, at least 75%, 80%, 85%, 90% , 95%, 97%, 98% or 99% sequence identity) with 
that specifically exemplified protein. 

Particularly preferred homologues are truncates that contain a segment preferably 
30 comprising at least 8, 15, 20 or 30 contiguous amino acids that share at least 75%, 80%, 
85%, 90% , 95%, 97%, 98% or 99% sequence identity with that specifically exemplified 
protein. 
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A "conservative amino acid change" is one in which the amino acid residue is replaced with 
an amino acid residue having a similar side chain. Families of amino acid residues having 
similar side chains have heen defined in the art. These families include amino acids vsdth 
basic side chains (e.g. lysine, arginine and histidine), acidic side chains (e.g. aspartic acid 
5 and glutamic acid), non-charged polar side chains (e.g. glycine, asparagine, glutamine, 

serine, threonine, tyrosine and cysteine), non-polar side chains (e.g. alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine and tryptophan), beta-branched side chains 
(e.g. threonine, valine and isoleucine), and aromatic side chains (e.g. tyrosine, 
phenylalanine, tryptophan and histidine). 

10 Thus, references herein to proteins and peptides that are to some defined extent "identical" 
(or which share a defined extent of "identity") with a reference protein or peptide may also 
optionally be interpreted to include proteins and peptides in which conservative amino acid 
changes are disregarded so that the original amino acid and its changed coimterpart are 
regarded as identical for the purposes of sequence comparisons. 

15 The term "allelic form" is used herein to define a naturally-occurring alternative forms of 
the sequence present in the BACE protein which reflect naturally-occurring differences in 
the BACE gene pool. Preferably, allelic variants of the proteins of the invention have at 
least 60% sequence identity (more preferably, at least 75%, 80%, 85%, 90% or 95% 
sequence identity) with the corresponding specifically exemplified BACE protein, where 

20 sequence identity is determined by comparing the nucleotide sequences of the 

polynucleotides when aligned so as to maximize overlap and identity while minimizing 
sequence gaps. 

The term "species variant" (or orthologue) is used herein to define the corresponding 
protein firom a different organism. Thus, species variants share a direct evolutionary 
25 relationship. 

The term "derivative" as applied herein to the BACE proteins of the invention is used to 
define proteins which are modified versions of the specifically exemplified proteins of the 
invention. Such derivatives may include fiision proteins, in which the proteins of the 
30 invention have been fiised to one or more different proteins, peptides or amino acid tags (for 
example an antibody or a protein domain conferring a biochemical activity, to act as a label, 
^ or to facilitate purification). Particularly preferred are derivatives in which the peptides are 
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modified by a polyHis (6xHis) tag to facilitate purification of the peptide derivative on Ni 
agarose beads. 

The derivatives may also be products of synthetic processes that use a peptide of the 
invention as a starting material or reactant. 

5 The term "mutein" is used herein to define proteins that are mutant forms of the BACE 
proteins of the invention, i.e. proteins in which one or more amino acids have been added, 
altered, deleted, replaced, inserted or substituted. Thus, the terms "BACE mutein" and 
"mutant BACE protein" are used interchangeably herein. The muteins/mutant BACE 
proteins of the invention therefore include firagments, tnmcates and fusion proteins and 
10 peptides (e.g. comprising fused immunoglobulin, receptor, tag, label or enzyme moieties). 

The muteins of the invention therefore include truncated forms of the BACE proteins of the 
invention. Such truncates are preferably least 25%, 35%, 50% or 75% of the length of the 
corresponding specifically exemplified BACE protein and may have at least 60% sequence 
identity (more preferably, at least 75%, 80%, 85%, 90% or 95% sequence identity) with that 
15 specifically exemplified protein. 

Particularly preferred are truncates that contain a segment preferably comprising at least 8, 
15, 20 or 30 contiguous amino acids that share at least 75%, 80%, 85%, 90% or 95% 
sequence identity with that specifically exemplified protein. 

For the purposes of the present invention, sequence identity is determined by comparing the 
20 amino acid sequences of the proteins when aligned so as to maximize overlap and identity 
while minimizing sequence gaps. In particular, sequence identity may be determined using 
any of a number of mathematical algorithms. A nonlimiting example of a mathematical 
algorithm used for comparison of two sequences is the algorithm of Karlin and Altschul 
(1990) Proc. Natl. Acad. Sci. USA 87: 2264-2268, modified as in Karlin and Altschul 
25 (1993) Proc. Natl. Acad. Sci. USA 90: 5873-5877. 

Another example of a mathematical algorithm used for comparison of sequences is the 
algorithm of Myers and Miller (1988) CABIOS 4: 1 1-17. Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of flie GCG sequence alignment 
software package. When utilizing the ALIGN program for comparing amino acid 
30 sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 
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4 can be used. Yet another useful algorithm for identifying regions of local sequence 
similarity and alignment is the FASTA algorithm as described in Pearson and Lipman 
(1988) Proc. Natl. Acad. Sci. USA 85: 2444-2448. 

Preferred for use according to the present invention is the WU-BLAST (Washington 
5 University BLAST) version 2.0 software. WU-BLAST version 2.0 executable programs for 
several UNIX platforms can be downloaded from ://blast. wustl. edu^last/executables. 
This program is based on WU-BLAST version 1 .4, which in turn is based on the public 
domain NCBI-BLAST version 1 .4 (Altschul and Gish, 1996, Local alignment statistics, 
Doolittle ed.. Methods in Enzymology 266: 460-480; Altschul et al., 1990, Basic local 
10 alignment search tool, Journal of Molecular Biology 215: 403-410; Gish and States, 1993, 
Identification of protein coding regions by database similarity search. Nature Genetics 3: 
266-272; Karlin and Altschul, 1993, AppUcations and statistics for multiple high-scoring 
segments in molecular sequences, Proc. Natl. Acad. Sci. USA 90: 5873-5877; all of which 
are incorporated by reference herein). 

15 Li all search programs in the suite the gapped alignment routines are integral to the database 
search itself. Gapping can be tumed off if desired. The default penalty (Q) for a gap of 
length one is Q=9 for proteins and BLASTP, and Q=10 for BLASTN, but may be changed 
to any integer. The default per-residue penalty for extending a gap (R) is R=2 for proteins 
and BLASTP, and R=10 for BLASTN, but may be changed to any integer. Any 

20 combination of values for Q and R can be used in order to align sequences so as to 

maximize overlap and identity while minimizing sequence gaps. The default amino acid 
comparison matrix is BLOSUM62, but other amino acid comparison matrices such as PAM 
can be utiUzed. 

25 The muteins of the invention also include peptides in which mutations have been introduced 
which effectively promote or impair one or more activities of the protein, for example 
mutations which promote or impair the function of a receptor, a recognition sequence or an 
effector binding site. 

30 Muteins may be produced by any convenient method. Conveniently, site-directed 

mutagenesis with mutagenic oligonucleotides may be employed using a double stranded 
template (pBluescript KS n construct containing nucleic acid encoding the BACE protein), 
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(e.g. Chameleon"^ or QiiikChange'^'^ - Stratagene™) or cassette mutagenesis methods my be 
employed. After verifying each mutant derivative by sequencing, the mutated gene is 
excised and inserted into a suitable vector so that the modified protein can be over- 
expressed and purified. 

5 Brief Description of the Drawings 

Table 1, provides the coordinates of the BACE structure. The numbering of the residues 
used in this Table (see Section (D) below) correspond to the numbering of used by Hong et 
al, ibid. Elsewhere - unless indicated to the contrary - in the specification the numbering of 
the SwissProt database entry P56817 is used. Residue 1 of Table 1 corresponds to 62 of 
10 SwissProt P56817, and residue 385 corresponds to 446 of SwissProt P56817. In the 

sequence listing below, the SwissProt P56817 residues 14-453 are shown as 16-455 of SEQ 
IDNO:2. 

Figure 1 represents the packing arrangements of the BACE monomers within the P6i22 
crystal lattice. 

15 Figure 2 shows the superposition of BACE in complex with OM99-2 (IFKN), in black, 
with BACE, of the invention, in the absence of ligand (grey). The position of OM99-2 is 
defined by a stick representation of the inhibitor. 

Detailed Description of the Invention 

A. Construct desieai 

20 BACE protease is expressed, at high levels, as insoluble inclusion bodies in bacterial cells. 
To prepare fimctional protein appropriate for enzyme assay and structural studies these 
inclusion bodies are solubilised using denaturants and the slow removal of these denaturants 
results in the formation of the correct tertiary structure. In addition BACE is expressed as a 
pro-sequence and requires activation by a protease before it is fiilly fimctional. 

25 One of the problems of the techniques described in the art (Tang et at) for isolation of 
BACE firom inclusion bodies is the generation of a mixture of products firom the 
uncontrolled cleavage process. Choppa et al describe the isolation of BACE firom 
mammalian cells and the subsequent cleavage with protease, which also gives a mixture of 
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protein species. Thus there is a need in the art for a method of generating active BACE as a 
homogenous species. 

A further problem with the prior art techniques is the low yield of crystallisable material 
obtained. The inventors surprisingly foimd that the present invention results in a high yield 
5 from bacterial cells, in particular E. coll 

The inventors utilized clostripain as an activating protease to perform this cleavage in a 
controlled manner but this produced multiple species of BACE, as determined by mass 
spectrometry. In order to obtain a uniform homogenous protein after activation, a number 
of different constructs were produced. These constructs focused on the mutation of two of 
10 the clostripain cleavage sites (R56 and R57). 

The sequences of the invention were designed to achieve a single cleavage point upon 
activation by clostripain, as activation of wild type sequence in this way resulted in a non- 
crystallisable protein with heterogeneous N termini. 

The BACE constructs of the invention contain successful modifications of the BACE 
15 sequence to allow generation of homogeneous protein product from the use of clostripain. 
The sequence of the invention contains substitution for another amino acid residue or 
deletion of the arginine 56 and/or arginine 57 (niimbering based on wild type full length 
sequence, SWISS_PROT P56817). In a preferred aspect of the invention this is a conserved 
substitution. Conservative amino acid substitutions are well known in the art, and include 
20 substitutions made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity and/or the amphipathic nature of the amino acid residues involved. For 
example, positively charged amino acids include lysine and arginine and histidine. In a 
preferred aspect the mutation introduced is substitution of arginine to lysine at position 56 
and/or 57, more preferably 56 and 57. This results in, as oppose to the wild type, the 
25 production of a single species of activated protein upon limited digest with clostripain. 
Clostripain cleavage occurs at a single site and is thus specific and generates a single 
species in minutes. 



The advantage of these mutations is that they allow the controlled cleavage at arginine 
residue 42 and hence provides a single N-terminus. 
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This controlled cleavage thiis provides a means to produce a substantially homogeneous 
composition of a B ACE protein of the invention. By substantially homogeneous, it is meant 
that at least 95%, preferably at least 98% and more preferably at least 99% of the BACE 
protein in the composition has the same N-temiinus. The N-terminus may be selected from 
5 residues 43 (i.e. by cleavage at 42), 46, 56, 57 or 58, preferably from 43, 56, 57 or 58, more 
preferably 43, 56 or 57. 

These mutations can be introduced onto any sequence of BACE by site-directed 
mutagenesis techniques, to facilitate the generation of homogeneous material for structural 
or activity studies. Thus proteins of the invention are BACE proteins with residues 56 
10 and/or 57 either mutated or deleted. Proteins of the invention also include BACE mutants 
described below in section (C). 

The invention is exemplified by several constructs (SEQ ID 5-18). These were built based 
on the wild type sequence (BACE WT, SEQ ID 2) where R56 and/or R57 were mutated to 
K or deleted. These were BACE WT R56KR57K (SEQ ID 6), BACE WT R57K (SEQ ID 

15 8), BACE WT R57del (SEQ ID 10). This was also performed on the BACE construct 

BACE N->Q to give BACE N->Q R56KR57K (SEQ ID 12), BACE N->Q R57K (SEQ ID 
16), BACE N->Q R57del (SEQ ID 18). The BACE N">Q construct contams 4 additional 
mutations of asparagines to glutamine and a C-terminal His tag as well as the arginine 
mutations. BACE N->Q without the His tag was mutated at 56 and 57 to give BACE N->Q 

20 R56K R57K no His (SEQ ID 14). 

SEQ ID 19 is the activated from of SEQ ID 6, SEQ ID 21 the activated form of SEQ ID 12 
and SEQ ID 20 the activated form of SEQ ID 14, i.e. the fomi in which the protein is 
crystallized. 

The three BACE constructs BACE WT R56KR57K, BACE N->Q R56KR57K, and BACE 
25 N->Q R56KR57K no His gave higher expression levels. 

Thus the invention concerns any BACE proteins with one or more of: a mutation at 56, and 
mutation at 57, or a deletion at 56 or a deletion at 57, but preferably 56 and 57 mutated, and 
crystals thereof i.e. any BACE protein comprising residues 56-396 of BACE (based on 
numbering of SwissProt P56817) and containing these mutations. 
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B. Refolding protocol 

The protein was expressed in E, coli as inclusion bodies, as outlined above. In an 
improvement of existing techniques BACE isolated from inclusion bodies was refolded by 
the use of high pH, a sulfobetaine refolding agent, and a longer duration at high pH. This 
5 refolding protocol increased the yield of refolded protein obtained and also gave high and 
reproducible yields of refolded BACE suitable for crystallisation. 

The use of high pH in refolding (Burton et el, 1989) and of sulfobetaines as solubilising 
molecules in folding experiments (Goldberg et a/, 1996) has previously been described. 
Here we describe the use of a combination of these technologies to give an unprecedented 
10 high yield of BACE. In addition to this combination of high pH and sulfobetaine, in 

another deviation from existing protocols for refolding BACE, the pH is maintained at high 
pH for at least 2 weeks. This is in comparison to the method of Tang et al, where BACE is 
solubilised at high pH and then the pH lowered before protein recovery at least 2-3 weeks 
later, preferably 3-4 weeks later. 

15 Another aspect of the invention therefore concerns a novel method of producing soluble 
BACE proteins of the invention, utilizing a refolding protocol comprising the combined 
techniques of high pH buffer and the use of sulfobetaine, and also maintaining this high pH 
over at least two weeks. 

More specifically, a method for producing refolded recombinant BACE comprising 
20 refolding the BACE under conditions \s4iich denature and then slowly renature the enzyme 
into a soluble form wherein: (a) the BACE is solubilised using a chaotrope such as urea or 
guanidine at 8-1 OM (typically 8 M urea solution) including one or more reducing agents at a 
pH of greater than 8.0 e.g. pH 9.0-10.5; (b) the BACE is then diluted into an aqueous buffer, 
like 20 mM-Tris, pH 9.0, containing sulfobetaine, preferably 10 mM sulfobetaine, where the 
25 sulfobetaine is preferably NDSB256 (3-(benzyldimethylammonio) propanesulfonate); (c) 
the solution is maintained at low temperature, e.g. 3-6 °C typically 4 ^'C, and at high pH, 
typically approximately pH 9.0, for at least 2 weeks (typically 3 weeks, more typically 4 
weeks) before proceeding with purification. 

C. Protein Crystals. 

30 Described herein is a crystal of BACE having a hexagonal space group P6i22, and unit cell 
dimensions a=b=103.2 A, c=169.1 A , a=p=60 •y=120*'. Unit cell variability of 5% may 



wo 2004/01 1641 PCT/GB2003/003200 

26 

be observed in all dimensions. Such crystals contain one copy of B ACE in the asymmetric 
unit. 

Such a crystal may be obtained using the methods described in the accompanying examples. 

The crystal may be of the BACE protein of SEQ ID 19 although as explained earlier any 
5 homologue, allelic form, species variant, derivative or mutein (as hereinbefore defined) may 
be used. Thus, it will be understood by those of skill in the art that some variation to the 
primaiy amino acid sequence may be made without significant alteration to the resulting 
crystal structure. Such minor variations include the replacement of one or more amino 
acids, for example from 1 to 30, such as 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 amino acids by an 
10 equivalent or fewer niraiber of amino acids. 

The methodology used to provide a BACE crystal illustrated herein may be used generally 
to provide a human BACE apo crystal resolvable at a resolution of at least 3 A. 

The invention thus fiirther provides an apo BACE crystal having a resolution better than, i.e. 
numerically lower than, 2.5 A. 

15 The invention also provides a BACE crystal having a resolution better than, i.e. numerically 
lower than, 1.8 A. 

The invention also provides apo crystals of BACE resolvable to at least 2.5 A capable of 
being soaked with compound(s) to form co-complex stmctures. 

The proteins may be wdld-type proteins or variants thereof, which are modified to promote 
20 crystal formation, for example by N-terminal truncations and/or deletion of loop regions, 
which prevent crystal formation. 

The methods described herein may be used to make a BACE protein crystal, particularly of 
a BACE protein of SEQ ID 19-21, which method comprises growing a crystal by vapour 
diffusion using a reservok buffer that contains 1 8-26 % PEG 5000 MME, preferably 20-24 
25 % PEG 5000 MME, more preferably 20-22.5 % PEG 5000 MME, with 1 80-220 mM (e.g. 
200 mM) ammonium iodide and 180-220 mM (e.g. 200 mM) tri-sodium citrate (pH 6.4- 
6.6). In a preferred embodiment, this reservoir buffer may also contain from 0 to 5% 
glycerol, e.g. about 2.5% glycerol. The growing of the crystal is by vapour diffusion and is 
performed by placing an aliquot of the protein solution on a cover slip as a hanging drop 
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above a well containing the reservoir buffer. The concentration of the protein solution used 
was approximately 7 mg/ml. 

Other crystals of the invention include crystals which have selected coordinates of the 
binding pocket, wherein the amino acid residues associated vsdth those selected coordinates 
5 are located in a protein framework which holds these amino acids in a relative spatial 
configuration corresponding to the spatial configuration of those amino acids in Table 1. 
By "corresponding to", it is meant within an r.m.s.d. of less than 2.0 A, preferably less than 
1.5 A, more preferably less than 1.0 A, even more preferably less than 0.74 A, even more 
preferably less than 0.72 A and most preferably less than 0.5 A from the Ca or backbone 
10 atoms of Table 1, preferably the Ca atoms. 

Crystals of the invention also include crystals of BACE mutants (muteins). In addition, 
BACE mutants may be crystallized in co-complex with known BACE substrates or 
inhibitors or novel compounds. 

As explained herein, a mutant BACE (or BACE mutein) is a BACE protein characterized by 
15 the replacement or deletion of at least one amino acid firom the wild type BACE. Such a 
mutant may be prepared for example by site-specific mutagenesis, or incorporation of 
natural or imnatural amino acids. 

As explained herein, the present invention therefore contemplates BACE mutants (or 
muteins) as hereinbefore defined. 

20 For example, the BACE mutants may define a polypeptide which is obtained by replacing at 
least one amino acid residue in a native or synthetic BACE with a different amino acid 
residue and/or by adding and/or deleting amino acid residues within the native polypeptide 
or at the N- and/or C-terminus of a polypeptide corresponding to BACE, and which has 
substantially the same three-dimensional structure as BACE fi-om which it is derived. By 

25 having substantially the same three-dimensional structure is meant having a set of atomic 
structure co-ordinates that have a root mean square deviation (r.m.s.d.) of less than or equal 
to about 2.0 A (preferably less than 1.5 A, more preferably less than 1.0 A, even more 
preferably less than 0.74 A, even more preferably less than 0.72 A and most preferably less 
than 0.5 A) when superimposed with the atomic structure co-ordinates of the BACE from 

30 which the mutant is derived when at least about 50% to 100% of the Ca atoms of the BACE 
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are included in the superposition. A mutant may have, but need not have, enzymatic or 
catalytic activity. 

To produce homologues or mutants, amino acids present in the said protein can be replaced 
by other amino acids having similar properties, for example hydrophobicity, hydrophobic 
5 moment, antigenicity, propensity to form or break a-helical or P-sheet structures, and so, 
Substitutional variants of a protein are those in which at least one amino acid in the protein 
sequence has been removed and a different residue inserted in its place. Amino acid 
substitutions are typically of single residues but may be clustered depending on functional 
constraints e.g. at a crystal contact. Preferably amino acid substitutions will comprise 
10 conservative amino acid substitutions. Insertional amino acid variants are those in which 
one or more amino acids are introduced. This can be amino-terminal and/or carboxy- 
terminal fusion as well as intrasequence. Examples of amino-terminal and/or carboxy- 
terminal fusions are affinity tags, MBP tag, and epitope tags. 

Deletional variants are those in wrhich one or more amino acids are removed. This can be 
15 amino-terminal and/or carboxy-terminal, or in an internal region (for example a loop 
region), for example to remove or shorten that region. 

Amino acid substitutions, deletions and additions that do not significantly interfere with the 
three-dimensional structure of the BACE will depend, in part, on the region of the BACE 
where the substitution, addition or deletion occurs, hi highly variable regions of the 
20 molecule, non-conservative substitutions as well as conservative substitutions may be 
tolerated without significantly disrupting the three-dimensional structure of the molecule. 
In highly conserved regions, or regions containing significant secondary structure, 
conservative amino acid substitutions are preferred. 

As explained earlier, conservative amino acid substitutions are well known in the art, and 
25 include substitutions made on the basis of similarity in polarity, charge, solubility, 

hydrophobicity, hydrophilicity and/or the amphipathic nature of the amino acid residues 
involved. For example, negatively charged amino acids include aspartic acid and glutamic 
acid; positively charged amino acids include lysine and arginine; amino acids with 
imcharged polar head groups having similar hydrophiUcity values include the following: 
30 leucine, isoleucine, valine; glycine, alanine; asparagine, glutamine; serine, threonine; 
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phenylalanine, tyrosine. Other conservative amino acid substitutions are well known in the 
art. 

In some instances, it may be particularly advantageous or convenient to substitute, delete 
and/or add amino acid residues to a BACE binding pocket or catalytic residue in order to 
5 provide convenient cloning sites in the cDNA encoding the polypeptide, to aid in 
purification of the polypeptide, to modify compoimd binding etc. Such substitutions, 
deletions and/or additions which do not substantially alter the three dimensional structure of 
BACE will be apparent to those having skills in the art. 

It should be noted that the mutants (BACE muteins) contemplated herein need not exhibit 
10 enzymatic activity. Indeed, amino acid substitutions, additions or deletions that interfere 
with the catalytic activity of the BACE but which do not significantly alter the three- 
dimensional structure of the catalytic region are specifically contemplated by the invention. 
Such crystalline polypeptides, or the atomic structure co-ordinates obtained there from, can 
be used to identify compounds that bind to the protein. 

The crystallization of such mutants and the determination of the three-dimensional 
structures by X-ray crystallography relies on the ability of the mutant proteins to yield 
crystals that diffract at high resolution. The mutant protein could then be used to obtain 
information on compound binding through the determination of mutant protein/ligand 
complex structures, v^hich may be characterized using the BACE crystal structure of Table 
1. 

The mutations can be introduced by site-directed mutagenesis e.g. using a Stratagene 
QuikChange™ Site-Directed Mutagenesis Kit or cassette mutagenesis methods (see e.g. 
Ausubel et al., eds., Current Protocols in Molecular Biology^ John Wiley & Sons, Inc., New 
York, and Sambrook et al.. Molecular Cloning: a Laboratory Manual, 2nd ed.. Cold Spring 
25 Harbor Laboratory Press, Cold Spring Harbor, NY, (1989)). 

To the extent that the present invention relates to BACE-ligand complexes and mutant, 
homologue, allelic form, species variant, derivative, mutein and analogue proteins of 
BACE, crystals of such proteins may be formed. The skilled person would recognize that 
the conditions provided herein for crystallising BACE may be used to form such crystals. 
30 Alternatively, the skilled person would use the conditions as a basis for identifying modified 
conditions for forming the crystals. 
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Thus the aspects of the mvention relating to crystals of B ACE, may be extended to crystals 
of mutant/mutein, homologue, allelic form, species variant or derivative (as defined herein). 

D. Crystal Coordinates 

In a further aspect, the invention also provides an apo crystal structure of B ACE having the 
5 three dimensional atomic coordinates of Table 1. An advantageous feature of the structure 
defined by the atomic coordinates is that it has a high resolution of about 1 .75 A. A further 
advantageous aspect is the provision of an apo structure of B ACE, which contains no ligand 
bound, unlike those previously described in the art. This is particularly advantageous as 
ligands can then be easily soaked into the crystal to provide co-complex data without the 
10 need for removal of any ligand already present, and without the need for time-consuming 
co-crystallisation experiments. 

The BACE structure set out in Table 1 is a monomer structure. This is the first time that a 
monomer has been observed crystallographically for this protein. 

Table 1 gives atomic coordinate data for BACE. In Table 1 the third column denotes the 
15 atom type, the fourth the residue type, the fifth the chain identification, the sixth the residue 
number (the atom numbering as described in Hong et al, 2000) the seventh, eighth and ninth 
columns are the X, Y, Z coordinates respectively of the atom in question, the tenth column 
the occupancy of the atom, the eleventh the temperature factor of the atom, the twelfth the 
chain identification, and the last, thirteenth column, the atom type. 

20 Each of the tables is presented in an internally consistent format. For example, in Table 1 
the coordinates of the atoms of each amino acid residue are listed such that the backbone 
nitrogen atom is first, followed by the C-alpha backbone carbon atom, designated CA, 
followed by the carbon and oxygen of the protein backbone and finally side chain residues 
(designated according to one standard convention). Alternative file formats (e.g. such as a 

25 format consistent with that of the EBI Macromolecidar Structure Database (Hinxton, UK)) 
which may include a different ordering of these atoms, or a different designation of the side- 
chain residues, may be used or preferred by others of skill in the art. However it will be 
apparent that the use of a different file format to present or manipulate the coordinates of 
the Tables is within the scope of the present invention. 
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The coordinates of Table 1 provide a measure of atomic location in Angstroms, to 3 decimal 
places. The coordinates are a relative set of positions that define a shape in three 
dimensions, but the skilled person would understand that an entirely different set of 
coordinates having a different origin and/or axes could define a similar or identical shape. 
5 Furthermore, the skilled person would understand that varying the relative atomic positions 
of the atoms of the structure so that the root mean square deviation of the residue backbone 
atoms (i.e. the nitrogen-carbon-carbon backbone atoms of the protein amino acid residues) 
is less than 2.0 A, preferably less than 1 .5 A, more preferably less than 1 .0 A, even more 
preferably less than 0.74 A, even more preferably less than 0.72 A and most preferably less 
10 than 0.5 A when superimposed on the coordinates provided in Table 1 for the Ca atoms or 
residue backbone atoms, will generally result in a structure which is substantially the same 
as the stmcture of Table 1 in terms of both its structural characteristics and usefulness for 
structure-based analysis of BACE-interactivity molecular structures. 

Likewise the skilled person would understand that changing the number and/or positions of 
15 the water molecules and/or substrate molecules of Table 1 will not generally affect the 

usefulness of the structure for structure-based analysis of BACE-interacting structure. Thus 
for the purposes described herein as being aspects of the present invention, it is within the 
scope of the invention if: the Table 1 coordinates are transposed to a different origin and/or 
axes; the relative atomic positions of the atoms of the structure are varied so that the root 
20 mean square deviation of residue backbone atoms is less than 2.0 A, preferably less than 1 .5 
A, more preferably less than 1 .0 A, even more preferably less than 0.74 A, even more 
preferably less than 0.72 A, and most preferably less than 0.5 A when superimposed on the 
coordinates provided in Table 1 for the Ca or residue backbone atoms; and/or the number 
and/or positions of water molecules and/or substrate molecules is varied. 

25 Reference herein to the coordinate data of Table 1 and the like thus includes the coordinate 
data in which one or more individual values of the Table are varied in this way miless 
specified explicitly to the contrary. In a preferred aspect, reference herein to the coordinates 
of Table 1 or parts thereof (e.g. selected coordinates) should be taken to include coordinates 
having a root mean square deviation of less than 0.72 A, and preferably less than 0.5 A, 

30 fi-om the Ca atoms of Table 1 or corresponding parts thereof. 

By "root mean square deviation" we mean the square root of the arithmetic mean of the 
squares of the deviations fi-om the mean. 
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Protein structure similarity is routinely expressed and measured by the root mean square 
deviation (r.m.s.d.), which measures the difference in positioning in space between two sets 
of atoms. The r.m.s.d. measures distance between equivalent atoms after their optimal 
superposition. The r.m.s.d. can be calculated over all atoms, over residue backbone atoms 
5 (i.e. the nitrogen-carbon-carbon backbone atoms of the protein amino acid residues), main 
chain atoms only (i.e. the nitrogen-carbon-oxygen-carbon backbone atoms of the protein 
amino acid residues), side chain atoms only or more usually over C-alpha atoms only. For 
the purposes of this invention, the r.m.s.d. can be calculated over any of these, using any of 
the methods outlined below. 

10 Methods of comparing protein structures are discussed in Methods of Enzymology, vol 115, 
pg 397-420. The necessary least-squares algebra to calcvilate r.m.s.d. has been given by 
Rossman and Argos (J. Biol. Chem. , vol 250, pp7525 (1975)) although faster methods have 
been described by Kabsch (Acta Crystallogr., Section A, A92, 922 (1976); Acta Cryst. A34, 
827-828 (1978)), Hendrickson (Acta Crystallogr., Section A, A35, 158 (1979) and 

15 McLachan (J. Mol. Biol., vol 128, pp49 (1979). Some algorithms use an iterative procedure 
in which the one molecule is moved relative to the other, such as that described by Ferro 
and Hermans (Ferro and Hermans, Acta Crystallographic, A33, 345-347 (1977)). Other 
methods e.g. Kabsch' s algorithm locate the best fit directly. 

It is usual to consider C-alpha atoms and the rrasd can then be calculated using programs 
20 such as LSQKAB (Collaborative Computational Project 4. The CCP4 Suite: Programs for 
Protein Crystallography, Acta Crystallographica, D50, (1994), 760-763), MNYFIT (part of 
a collection of programs called COMPOSER, Sutcliffe, M.J., Haneef, I., Carney, D. and 
Blundell, T.L. (1987) Protem Engineering, 1, 377-384), MAPS (Lu, G. An Approach for 
Multiple Alignment of Protein Structures (1998, in manuscript)), QUANTA (Jones et al., 
25 Acta Crystallography A47 (1 99 1 ), 1 1 0- 1 1 9 and conunercially available from Accelerys, 

San Diego, CA), Insight (commercially available from Accelerys, San Diego, CA), Sybyl® 
(commercially available from Tripos, Inc., St Louis), O (Jones et al.. Acta 
Crystallographica, A47, (1991), 1 10-1 19), and other coordinate fitting programs. 

In, for example the programs LSQKAB and O, the user can define the residues in the two 
30 proteins that are to be paired for the purpose of the calculation. Alternatively, the pairing of 
residues can be determined by generating a sequence alignment of the tv^o proteins, 
programs for sequence alignment are discussed in more detail in Section G. The atomic 
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coordinates can then be superimposed according to this alignment and an r.m.s.d. value 
calculated. The program Sequoia (CM. Bruns, I. Hubatsch, M, RidderstrOm, B. Mannervik, 
and J. A. Tainer (1999) Human Glutathione Transferase A4-4 Crystal Structures and 
Mutagenesis Reveal the Basis of High Catalytic Efficiency with Toxic Lipid Peroxidation 

5 Products, Journal of Molecular Biology 288(3): 427-439) performs the alignment of 
homologous protein sequences, and the superposition of homologous protein atomic 
coordinates. Once aligned, the r.m.s.d. can be calculated using programs detailed above. For 
sequence identical, or highly identical, the structural alignment of proteins can be done 
manually or automatically as outlined above. Another approach would be to generate a 

10 superposition of protein atomic coordinates without considering the sequence. 

It is more normal when comparing significantly different sets of coordinates to calculate the 
r.m.s.d. value over C-alpha atoms only. It is particularly useful when analysing side chain 
movement to calculate the r.m.s.d. over all atoms and this can be done using LSQKAB and 
other programs. 

15 Varying the atomic positions of the atoms of the structure by up to about 0.5 A in a 

concerted way, preferably up to about 0.3 A in any direction will result in a structure which 
is substantially the same as the structure of Table 1 in terms of both its structural 
characteristics and utility e.g. for molecular structure-based analysis. 

Also, modifications in the BACE crystal structure due to e.g. mutations, additions, 
20 substitutions, and/or deletions of amino acid residues (including the deletion of one or more 
BACE protomers) could accoimt for variations in the BACE atomic coordinates. However, 
atomic coordinate data of BACE modified so that a ligand that boimd to one or more 
binding sites of BACE would be expected to bind to the corresponding binding sites of the 
modified BACE are, for the purposes described herein as being aspects of the present 
25 invention, also within the scope of the invention. Reference herein to the coordinates of 
Table 1 thus includes the coordinates modified in this way. Preferably, the modified 
coordinate data define at least one BACE binding cavity. 

Those of skill in the art will appreciate that in many applications of the invention, it is not 
necessary to utilise all tiie coordinates of Table 1, but merely a portion of them. The term 
30 portion is intended to define a sub-set of the coordinates, which may or may not represent 
contiguous amino acid residues in the BACE structure. For example, as described below, in 
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methods of modelling candidate compounds with BACE, selected coordinates of B ACE 
may be used, for example at least 5, preferably at least 10, more preferably at least 50 and 
even more preferably at least 100 atoms of the BACE structure. Likewise, the other 
applications of the invention described herein, including homology modelling and structure 
5 solution, and data storage and computer assisted manipulation of the coordinates, may also 
utilise all or a portion of the coordinates of Table L 

E. Homology Modelling 

The invention also provides a means for homology modelling of other proteins (referred to 
belov^ as target BACE proteins). By "homology modelling", it is meant the prediction of 
10 related BACE structures based either on X-ray crystallographic data or computer-assisted de 
novo prediction of structure, based upon manipulation of the coordinate data of Table 1. 

"Homology modelling" extends to target BACE proteins, which are analogues or 
homologues of the BACE protein whose structure has been determined in the 
accompanying examples. It also extends to BACE protein mutants of BACE protein itself 

15 The term "homologous regions" describes amino acid residues in two sequences fliat are 
identical or have similar (e.g. aliphatic, aromatic, polar, negatively charged, or positively 
charged) side-chain chemical groups. Identical and similar residues in homologous regions 
are sometimes described as being respectively "invariant" and "conserved" by those skilled 
in the art. 

20 In general, the method involves comparing the amino acid sequences of the BACE protein 
of Table 1 with a target BACE protein by aligning the amino acid sequences (Dunbrack et 
al.. Folding and Design^ 2, (1997), 27-42). Amino acids in the sequences are then 
compared and groups of amino acids that are homologous (conveniently referred to as 
"corresponding regions") are grouped together. This method detects conserved regions of 

25 the polypeptides and accounts for amino acid insertions or deletions. 

Homology between amino acid sequences can be determined using commercially available 
algorithms. The programs BLAST, gapped BLAST, BLASTN, PSI-BLAST and BLAST 2 
sequences (provided by the National Center for Biotechnology Information) are widely used 
in the art for this purpose, and can align homologoxis regions of two amino acid sequences. 
30 These may be used with default parameters to determine the degree of homology between 
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the amino acid sequence of the Table 1 protein and other target BACE proteins, which are 
to be modeled. 

Analogues are defined as proteins with similar three-dimensional structures and/or functions 
with little evidence of a common ancestor at a sequence level. 

5 Homologues are defined as proteins with evidence of a common ancestor, i-e. likely to be 
the result of evolutionary divergence and are divided into remote, mediimi and close sub- 
divisions based on the degree (usually expressed as a percentage) of sequence identity. 

A homologue is defined here as a protein with at least 15% sequence identity or which has 
at least one functional domain, which is characteristic of BACE. 

10 There are two types of homologue: orthologues and paralogues. Orthologues are defined as 
homologous genes in different organisms, i.e. the genes share a common ancestor 
coincident with the speciation event that generated fhem. Paralogues are defined as 
homologous genes in the same organism derived fi:om a gene/chromosome/ genome 
duplication, i.e. the common ancestor of the genes occurred since the last speciation event. 

15 The homologues could also be mutants as described in section (C). 

Once the amino acid sequences of the polypeptides with known and imknown structures are 
aligned, the structures of the conserved amino acids in a computer representation of the 
polypeptide v^th known structure are transferred to the corresponding amino acids of the 
polypeptide whose structure is unknown. For example, a tyrosine in the amino acid 
20 sequence of known structure may be replaced by a phenylalanine, the corresponding 
homologous amino acid in the amino acid sequence of unknown structure. 

The structures of amino acids located in non-conserved regions may be assigned manually 
by using standard peptide geometries or by molecular simulation techniques, such as 
molecular dynamics. The final step in the process is accomplished by refining the entire 
25 structure using molecular dynamics and/or energy minimization. 



Homology modelling as such is a technique that is well known to those skilled in the art 
(see e.g. Greer, Science, Vol. 228, (1985), 1055, and Blundell etal, Eur, J. Biochem, Vol. 
172, (1988), 513). The techniques described in these references, as well as other homology 
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modelling techniques, generally available in the art, may be used in performing the present 
invention. 

Thus the invention provides a method of homology modelling comprising the steps of: (a) 
aligning a representation of an anaino acid sequence of a target B ACE protein of unknown 

5 three-dimensional structure with the amino acid sequence of the BACE of Table 1 to match 
homologous regions of the amino acid sequences; (b) modelling the structure of the 
matched homologous regions of said target BACE of unknovm structure on the 
corresponding regions of the BACE structure as defined by Table 1; and (c) determining a 
conformation (e.g. so that favorable interactions are formed within the target BACE of 

10 unknown structure and/or so that a low energy conformation is formed) for said target 
BACE of unknown structure which substantially preserves the structure of said matched 
homologous regions. 

Preferably one or all of steps (a) to (c) are performed by computer modelling. 

The aspects of the invention described herein which utilise the BACE structure in silico 
15 may be equally applied to homologue models of BACE obtained by the above aspect of the 
invention, and this application forms a further aspect of the present invention. Thus having 
determined a conformation of a BACE by the method described above, such a conformation 
may be used in a computer-based method of rational drug design as described herein. 

The absence of a ligand from our structure is particularly advantageous for modelling of 
20 other proteins as this structure reveals the native structure of the protein unaffected by 
conformational changes upon ligand bindiag. 

F. Structure Solution 

The stmcture of the human BACE can also be used to solve the crystal structure of other 
target BACE proteins including other crystal forms of BACE, mutants, and co-complexes of 
25 BACE, where X-ray diffraction data or NMR spectroscopic data of these target BACE 
proteins has been generated and requires interpretation in order to provide a structure. 

In the case of BACE, this protein may crystallize in more than one crystal form. The 
structure coordinates of BACE, or portions thereof, as provided by this invention are 
particularly useful to solve the structure of those other crystal forms of BACE. They may 
30 also be used to solve the structure of BACE mutants, BACE co-complexes, or of the 



wo 2004/01 1641 PCT/GB2003/003200 

37 

crystalline form of any other protein with significant amino acid sequence homology to any 
functional domain of BACE. 

In the case of other target BACE proteins, particularly the BACE proteins referred to in 
Section C above, the present invention allows the structures of such targets to be obtained 
5 more readily where raw X-ray diffraction data is generated. 

Thus, where X-ray crystallographic or NMR spectroscopic data is provided for target 
BACE-ligand complex, or a BACE homologue or analogue of unknown three-dimensional 
structure, the structure of BACE, as defined by Table 1, may be used to interpret that data to 
provide a likely structure for the other BACE by techniques which are well known in the 
10 art, e.g. phasing in the case of X-ray crystallography and assisting peak assignments in 
NMR spectra. 

One method that may be employed for these purposes is molecular replacement. Jn this 
method, the unknown crystal structure, whether it is another crystal form of BACE, a BACE 
mutant, or a BACE co-complex, or the crystal of a target BACE protein with amino acid 
15 sequence homology to any fimctional domain of BACE, may be determined using the 

BACE structure coordinates of this invention as provided herein. This method will provide 
an accurate structural form for the unknown crystal more quickly and efficiently than 
attempting to determine such information ab initio. 

Examples of computer programs known in the art for performing molecular replacement are 
20 CNX (Brunger A.T.; Adams P.D.; Rice L.M., Current Opinion in Structural Biology, 
Volume 8, Issue 5, October 1998, Pages 606-611 (also commercially available fiom 
Accelerys San Diego, CA) or AMORE (Navaza, J. (1994). AMoRe: an automated package 
for molecular replacement. Acta Cryst A50, 157-163). 

Thus, in a fiirther aspect of the invention provides a method for determining the structure of 
25 a protein, which method comprises; providing the co-ordinates of Table 1 , and either (a) 
positioning the co-ordinates in the crystal unit cell of said protein so as to provide a 
structure for said protein or (b) assigning NMR spectra Peaks of said protein by 
manipulating the coordinates of Table 1 . 
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In a preferred aspect of this invention the co-ordinates are used to solve the structure of 
target BACE particularly homologues of B ACE for example aspartic proteases such as 
BACE2 or cathepsin E (69% and 37% similarity, respectively). 

G. Computer Svstems 

5 In another aspect, the present invention provides systems, particularly a computer system, 
the systems containing either (a) atomic coordinate data according to Table 1, said data 
defining the three-dimensional structure of BACE or at least selected coordinates thereof; 
(b) structure factor data (where a structure factor comprises the amplitude and phase of the 
dif&acted wave) for BACE, said structure factor data being derivable from the atomic 

10 coordinate data of Table 1; (c) atomic coordinate data of a target BACE protein generated 
by homology modelling of the target based on the data of Table 1; (d) atomic coordinate 
data of a target BACE protein generated by interpreting X-ray crystallographic data or 
NMR data by reference to the data of Table 1; or (e) structure factor data derivable firom the 
atomic coordinate data of (c) or (d), 

15 For example the computer system may comprise: (i) a computer-readable data storage 

medium comprising data storage material encoded with the computer-readable data; (ii) a 
working memory for storing instructions for processing said computer-readable data; and 
(iii) a central-processing unit coupled to said working memory and to said computer- 
readable data storage medium for processing said computer-readable data and thereby 

20 generating structures and/or performing rational drug design. The computer system may 
further comprise a display coupled to said central-processing unit for displaying said 
structures. 

The invention also provides such systems containing atomic coordinate data of target BACE 
proteins wherein such data has been generated according to the methods of the invention 
25 described herein based on the starting data provided by Table 1 . 

Such data is useful for a number of purposes, including the generation of structures to 
analjrze the mechanisms of action of BACE proteins and/or to perform rational drug design 
of compounds which interact with BACE, such as compoimds which are inhibitors of 
BACE. 
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In another aspect, the invention provides a computer-readable storage medium, comprising 
a data storage material encoded with computer readable data, wherein the data are defined 
by all or a portion (e.g. selected coordinates as defined herein) of the structure coordinates 
of B ACE of Table 1, or a homologue of B ACE, wherein said homologue comprises 
5 backbone atoms that have a root mean square deviation fi-om the Ca or backbone atoms 
(nitrogen-carbona-carbon) of Table 1 of less than 2 A, such as not more than 1.5 A, 
preferably less than L5 A, more preferably less than 1.0 A, even more preferably less than 
0.74 A, even more preferably less than 0.72 A and most preferably less than 0,5 A. 

The invention also provides a computer-readable data storage medium comprising a data 
10 storage material encoded with a first set of computer-readable data comprising a Fourier 
transform of at least a portion (e.g. selected coordinates as defined herein) of the structural 
coordinates for BACE according to Table 1; which, when combined with a second set of 
machine readable data comprising an X-ray diffraction pattern of a molecule or molecular 
complex of unknown structure, using a machine programmed with the instructions for using 
15 said first set of data and said second set of data, can determine at least a portion of the 
stmcture coordinates corresponding to the second set of machine readable data. 

In a further aspect, the present invention provides computer readable media with with at 
least one of: (a) atomic coordinate data according to Table 1 recorded thereon, said data 
defining the three-dimensional structure of BACE, or at least selected coordinates thereof; 

20 (b) structure factor data for BACE recorded thereon, the structure factor data being 

derivable fi:om the atomic coordinate data of Table 1; (c) atomic coordinate data of a target 
BACE protein generated by homology modelling of the target based on the data of Table 1; 
(d) atomic coordinate data of a BACE-ligand complex or a BACE homologue or analogue 
generated by interpreting X-ray crystallographic data or NMR data by reference to the data 

25 of Table 1 ; and (e) structure factor data derivable firom the atomic coordinate data of (c) or 
(d). 

By providing such computer readable media, the atomic coordinate data can be routinely 
accessed to model BACE or selected coordinates thereof. For example, RASMOL (Sayle et 
al., TIBS, Vol. 20, (1995), 374) is a publicly available computer software package which 
30 allows access and analysis of atomic coordinate data for structure determination and/or 
rational drug design. 
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On the other hand, structure factor data, which are derivable firom atomic coordinate data 
(see e.g. Blundell et al., in Protein Crystallography^ Academic Press, New York, London 
and San Francisco, (1976)), are particularly useful for calculating e.g. difference Fourier 
electron density maps. 

A further aspect of the invention provides a method of providing data for generating 
structures and/or performing rational drug design for BACE, B ACE homologues or 
analogues, complexes of B ACE with a potential modidator, or complexes of B ACE 
homologues or analogues with potential modulators, the method comprising: 

(i) establishing commimication with a remote device containing computer-readable data 
comprising at least one of: (a) atomic coordinate data according to Table 1, said data 
defining the three-dimensional structure of B ACE, at least one sub-domain of the three- 
dimensional structure of BACE, or the coordinates of a plurality of atoms of BACE; (b) 
structure factor data for BACE, said structure factor data being derivable fi:om the atomic 
coordinate data of Table 1; (c) atomic coordinate data of a target BACE homologue or 
analogue generated by homology modelling of the target based on the data of Table 1; (d) 
atomic coordinate data of a protein generated by interpreting X-ray crystallographic data or 
NMR data by reference to the data of Table 1 ; and (e) structure factor data derivable firom 
the atomic coordinate data of (c) or (d); and (ii) receiving said computer-readable data firom 
said remote device. 

Thus the remote device may comprise e.g. a computer system or computer readable media 
of one of the previous aspects of the invention. The device may be in a different coimtry or 
jurisdiction firom where the computer-readable data is received. The communication may 
be via the internet, intranet, e-mail etc. Typically the communication will be electronic in 
nature, but some or all of the commimication pathway may be optical, for example, over 
optical fibres. Additionally, the communication may be through radio signals or satellite 
transmissions. 

H. Uses of the Crvstals of the Invention 

The crystal structures obtained according to the present invention (including the structure of 
Table 1 as well the structures of target BACE proteins obtained in accordance with the 
methods described herein), may be used in several ways for drug design. 
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By identifying conditions under which high quality crystals of apo-BACE can be produced 
(i.e. crystals which can diffract X-rays for the determination of atomic coordinates to a 
resolution of better than 2.5 A), the present invention facilitates the identification of 
modulators of B ACE activity. 

5 The invention is particularly suitable for the design, screening, development and 

optimization of B ACE inhibitor components. It is thus a preferred aspect of the invention 
that modulators are inhibitors. 

In a further aspect, the invention provides a method for determining the structure of a 
compound bound to BACE, said method comprising: (a) providing a crystal of BACE 
10 according to the invention; (b) soaking the crystal with said compounds; and (c) 

determining the structure of said BACE compound complex by employing the data of Table 
1. 

Alternatively, the BACE and compound may be co-crystallized. Thus the invention 
provides a method for determining the structure of a compound boimd to BACE, said 
15 method comprising; mixing the protein with the compound(s), crystallizing the protein- 
compound(s) complex; and determining the structure of said BACE-compound(s) complex 
by reference to the data of Table 1. 

A mixture of compounds may be soaked or co-crystallized with the crystal, wherein only 
one or some of the compoimds may be expected to bind to the BACE. As well as the 
20 structure of the complex, the identity of the complexing compound(s) is/are then 
determined. 

In either ceise, substrate or a substrate analogue thereof may optionally be present. 

The method may comprise the further steps of: (a) obtaining or synthesising said candidate 
modulator; (b) forming a complex of BACE and said candidate modulator; and (c) 
25 analysing said complex by X-ray crystallogrs^hy or NMR spectroscopy to determine the 
ability of said candidate modulator to interact with BACE. 

The analysis of such structures may employ (i) X-ray crystallographic diffiaction data from 
the complex and (ii) a three-dimensional structure of BACE, or at least selected coordinates 
thereof, to generate a difference Fourier electron density map of the complex, the three- 
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dimensional structure being defined by atomic coordinate data according to Table 1. The 
difference Fourier electron density map may then be analyzed, to identify the binding mode 
of the modulator. 

Therefore, such complexes can be crystallized and analyzed using X-ray dif&action 
5 methods, e.g. according to the approach described by Greer et al., J. of Medicinal 

Chemistry^ Vol. 37, (1994), 1035-1054, and difference Fourier electron density maps can be 
calculated based on X-ray diffraction patterns of soaked crystals of B ACE or co-crystallized 
BACE and the solved stmcture of uncomplexed BACE. These maps can then be analyzed 
e.g. to determine whether and where a particvdar compound binds to BACE and/or changes 
1 0 the conformation of BACE. 

Electron density maps can be calculated using programs such as those from the CCP4 
computing package (Collaborative Computational Project 4. The CCP4 Suite: Programs for 
Protein Crystallography, Acta Crystallographica, D50, (1994), 760-763.). For map 
visualization and model building programs such as "O" (Jones et al,. Acta 
15 Crystallographica, A47, (1991), 1 10-119) or "QUANTA" (1994, San Diego, CA: 
Molecular Simulations can be used. 

The crystal structures of a series of complexes may then be solved by molecular 
replacement and compared with that of the BACE of Table 1. Potential sites for 
modification within the various binding sites of the enzyme may thus be identified. This 
20 information provides an additional tool for determining the most eflScient binding 
interactions, for example, increased hydrophobic interactions, between BACE and a 
chemical entity or compound. 

All of the complexes referred to above may be studied using well-known X-ray diffraction 
techniques and may be refined against 1 .5 to 3.5 A resolution X-ray data to an R value of 
25 about 0.30 or less using computer software, such as CNX (Brunger et al.. Current Opinion 
in Structural Biology, Vol. 8, Issue 5, October 1998, 606-61 1, and commercially available 
firom Accelerys, San Diego, CA), X-PLOR (Yale University, ©1992, distributed by 
Accelerys), as described by Blundell et al, (1976) and Methods in Enzymology, vol. 1 14 & 
1 15, H. W. Wyckoff et al,, eds.. Academic Press (1985). 
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This information may thus be vised to optimize known classes of B ACE substrates or 
inhibitors, and more importantly, to design and synthesize novel classes of B ACE 
inhibitors. 

Analysing the complex by X-ray crystallography will determine the ability of the candidate 
5 compound to interact with BACE. Analysis of the co-complexes of BACE may involve e.g. 
phasing, molecular replacement or calculating a Fourier difference map of the complex as 
discussed above. However, with the high resolutions obtainable with the crystal, it can also 
be possible to determine the ability of the candidate modulator to interact with BACE 
merely by comparing the intensities and/or positions of X-ray diffiraction spots from the 
10 complex with e.g. diffraction spots of xmcomplexed BACE or a previously identified 

BACE-ligand complex. Thus the step of analysing the complex may involve analysing tfie 
intensities and/or positions of X-ray diffraction spots from the complex to determine the 
ability of the candidate modulator to interact with BACE. 

Having obtained and characterized a modulator compound according to the invention, the 
15 invention further provides a method for modulating the activity of BACE which method 
comprises: (a) providing BACE under conditions where, in the absence of modulator, the 
BACE is able to synthesize amyloid P-peptide from amyloid precursor protein (APP); (b) 
providing a modulator compound; and (c) determining the extent to which the activity of 
BACE is altered by the presence of said compoimd. 

20 I. Structure-based Drug Design 

Determination of the three-dimensional structure of BACE provides important information 
about the binding sites of BACE, particularly when comparisons are made with similar 
enzymes. This information may then be used for rational design of BACE inhibitors, e.g. 
by computational techniques which identify possible binding ligands for the binding sites, 
25 by enabling linked-fragment approaches to drug design, and by enabling the identification 
and location of bound ligands using X-ray crystallographic analysis. These techniques are 
discussed in more detail below. 

Greer et al (1994) describes an iterative approach to ligand design based on repeated 
sequences of computer modelling, protein-ligand complex formation and X-ray 
30 crystallographic or NMR spectroscopic analysis. Thus novel thymidylate synthase inhibitor 
series were designed de novo by Greer et aL, and BACE inhibitors may also be designed in 
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the this way. More specifically, using e.g. GRID on the solved 3D structure of BACE, a 
ligand (e.g. a potential inhibitor) for BACE may be designed that complements the 
functionalities of the BACE binding sites. The ligand can then be synthesised, formed into 
a complex with BACE, and the complex then analysed by X-ray crystallography to identify 
5 the actual position of the bound ligand. The structure and/or functional groups of the ligand 
can then be adjusted, if necessary, in view of the results of the X-ray analysis, and the 
synthesis and analysis sequence repeated until an optimised ligand is obtained. Related 
approaches to structure-based drug design are also discussed in Bohacek et al.^ Medicinal 
Research Reviews, Vol.16, (1996), 3-50. 

10 Linked-fragment approaches to drug design also require accurate information on the atomic 
coordinates of target receptors. The basic idea behind these approaches is to determine 
(computationally or experimentally) the binding locations of plural ligands to a target 
molecule, and then construct a molecular scaffold to connect the ligands together in such a 
way that their relative binding positions are preserved. The ligands may be provided 

15 computationally and modelled in a computer system, or provided in an experimental setting, 
wherein crystals according to the invention are provided and a plurality of ligands soaked 
separately or in mixed pools into the crystal prior to X-ray analysis and determination of 
their location. 

The binding site of two or more ligands are determined and may be connected to form a 
20 potential lead compoxmd that can be further refined using e.g. the iterative technique of 
Greer et al. For a virtual linked-fragment approach see Verlinde et al., J, of Computer- 
Aided Molecular Design^ 6, (1992), 13 1-147, and for NMR and X-ray approaches see 
Shuker et al., Science, 274, (1996), 1531-1534 and Stout et al.. Structure, 6, (1998), 839- 
848. The use of these approaches to design BACE inhibitors is made possible by the 
25 determination of the BACE structure. 

Many of the techniques and approaches to structure-based drug design described above rely 
at some stage on X-ray analysis to identify the binding position of a ligand in a ligand- 
protein complex. A common way of doing this is to perform X-ray crystallography on the 
complex, produce a difference Fourier electron density map, and associate a particular 
30 pattem of electron density with the ligand. However, in order to produce the map (as 

explained e.g. by Blundell et al (1976)) it is necessary to know beforehand the protein 3D 
structure (or at least the protein structure factors). Therefore, determination of the BACE 
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Structure also allows difference Fourier electron density maps of BACE-ligand complexes 
to be produced, which can greatly assist the process of rational drug design. 

The provision of the crystal structures of the invention will also allow the development of 
compounds which interact with the binding pocket regions of B ACE (for example to act as 
5 inhibitors of a BACK) based on a fragment linking or fragment growing approach. 

For example, the binding of one or more molecular fragments can be determined in the 
protein binding pocket by X-fay crystallography. Molecular fragments are typically 
compounds with a molecxdar weight between 100 and 200 Da (Carr et al, 2002). This can 
then provide a starting point for medicinal chemistry to optimize the interactions using a 

10 stmcture-based approach. The fragments can be combined onto a template or used as the 
starting point for * growing out' an inhibitor into other pockets of the protein (Blimdell et al, 
2002). The fragments can be positioned in the binding pocket of B ACE and then 'grown' to 
jfill the space available, exploring the electrostatic, van der Waals or hydrogen-bonding 
interactions that are involved in molecular recognition. The potency of the original weakly 

15 binding fragment thus can be rapidly improved using iterative structure-based chemical 
synthesis. 

At one or more stages in the fragment growing approach, the compoxmd may be synthesized 
and tested in a biological system for its activity. This can be used to guide the ftirther 
growing out of the fragment. 

20 Where two fragment-binding regions are identified, a linked fragment approach may be 

based upon attempting to link the two fragments directly, or growing one or both fragments 
in the manner described above in order to obtain a larger, linked structure, which may have 
the desired properties. 

The previous aspects of the invention relate also to fragment linking or fragment growing 
25 approaches to rational drug design. Thus the step of providing the structure of a candidate 
modulator molecule in the previous aspects may be performed by providing the structures of 
a plurality of molecular fragments and linking the molecular fragments to form a candidate 
modulator molecule. Furthermore the step of fitting the structure of the candidate 
modulator molecule in the previous aspects may be performed by fitting the structure of 
30 each of the molecular fragments (before or after the molecular fragments are linked 
together). 
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For example, the computer-based method of rational drug design may comprise: 

(a) providing the coordinates of at least two atoms of the BACE of Table 1 ; (b) providing 
the structures of a plurality of molecular fragments; (c) fitting the structure of each of the 
molecular fragments to the selected coordinates of the BACE; and (d) assembling the 
5 molecular fragments into a single molecule to form a candidate modulator molecule. 

In practice, it will be desirable to model a sufficient number of atoms of the BACE as 
defined by the coordinates of Table 1 , which represent a binding pocket Thus, in this 
embodiment of the invention, there will preferably be provided flie coordinates of at least 5, 
preferably at least 10, more preferably at least 50 and even more preferably at least 100 
10 preferably at least 500 selected atoms of the BACE structure. 

A further aspect of the invention provides a compound having a chemical structure selected 
using the method of any one of the previous aspects, said compoxmd being an inhibitor of 
BACE. 

J. Uses of the Coordinates of the Invention in in silico analysis and design 

15 Although the invention will facilitate the determination of actual crystal structures 
comprising BACE and a compound, which modulates BACE, current computational 
techniques provide a powerful alternative to the need to generate such crystals and generate 
and analyze diffraction data. Accordingly, a particularly preferred aspect of the invention 
relates to in silico methods directed to the analysis and development of compounds, which 

20 interact, with BACE structures of the present invention. 

The approaches to structure-based drug design described below all require initial 
identification of possible compounds for interaction with target bio-molecule (in this case 
BACE). Sometimes these compounds are known e.g. from the research literature. 
However, when they are not, or when novel compounds are wanted, a first stage of the drug 

25 design program may involve computer-based in silico screening of compound databases 
(such as the Cambridge Structural Database) with the aim of identifying compounds which 
interact with the binding site or sites of the target bio-molecule. Screening selection criteria 
may be based on pharmacokinetic properties such as metabolic stability and toxicity. 
However, determination of the BACE stmcture allows the architecture and chemical nature 

30 of each BACE binding site to be identified, which in turn allows the geometric and 
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functional constraints of a descriptor for the potential inhibitor to be derived. The descriptor 
is, therefore, a type of virtual 3-D pharmacophore, which can also be used as selection 
criteria or jQlter for database screening. 

Thus as a result of the determination of the BACE three-dimensional structure, more purely 
5 computational techniques for rational drug design may also be used to design BACE 
inhibitors (for an overview of these techniques see e.g. Walters et al (Drug Discovery 
Today, Vol.3, No.4, (1998), 160-178; Abagyan, R.; Totrov, M. Curr. Opiru Chem. Biol 
2001, 5, 375-382). For example, automated ligand-receptor docking programs (discussed 
e.g. by Jones et al. in Current Opinion in Biotechnology, Vol.6, (1995), 652-656 and 
10 Halperin, L; Ma, B.; Wolfson, H.; Nussinov, R. Proteins 2002, 47, 409-443), which require 
accurate information on the atomic coordinates of target receptors may be used to design 
potential BACE inhibitors. 

The aspects of the invention described herein which utilize the BACE structure in silico 
may be equally applied to both the BACE structure of Table 1 and the models of target 
15 BACE proteins obtained by other aspects of the invention. Thus having determined a 

conformation of a BACE by the method described above, such a conformation may be used 
in a computer-based method of mtional drug design as described herein. In addition the 
availabilily of the structure of the BACE will allow the generation of highly predictive 
pharmacophore models for virtual library screening or compound design. 

20 Accordingly, the invention provides a computer-based method for the analysis of the 
interaction of a molecular structure with a BACE structure of the invention, which 
comprises: (a) providing the structure of a BACE of the invention of Table 1; (b) providing 
a molecular structure to be fitted to said BACE structure; and (c) fitting the molecxilar 
structure to the BACE structure of Table 1 . 

25 In an alternative aspect, the method of the invention may utilize the coordinates of atoms of 
interest of BACE, which are in the vicinity of a putative molecular structure binding region, 
for example within 10-25 A of the catalytic regions or within 5-10 A of a compoimd bound, 
in order to model the pocket in which the structure binds. These coordinates may be used to 
define a space, which is then analyzed "m silico". Thus the invention provides a computer- 

30 based method for the analysis of molecular structures which comprises: (a) providing the 
coordinates of at least two atoms of a BACE structure of the invention ("selected 
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coordinates"); (b) providing the structure of a molecular structure to be fitted to said 
coordinates; and (c) fitting the structure to the selected coordinates of the BACE. 

In practice, it will be desirable to model a sufficient number of atoms of the BACE as 
defined by the coordinates of Table 1, which represent a binding pocket. Thus, in this 
5 embodiment of the invention, there will preferably be provided the coordinates of at least 5, 
preferably at least 10, more preferably at least 50 and even more preferably at least 100 and 
preferably 500 selected atoms of the BACE structure. 

In order to provide a three-dimensional structure of compoxmds to be fitted to a BACE 
structure of the invention, the compound structure may be modelled in three dimensions 
10 using commercially available software for this purpose or, if its crystal structure is 

available, the coordinates of the structure may be used to provide a representation of the 
compound for fitting to a BACE structure of the invention. 

The step of providing the structure of a candidate modulator molecule may involve selecting 
the compound by computationally screening a database of compounds for interaction with 

15 the binding cavity or cavities. For example, a 3-D descriptor for the potential modulator 
may be derived, the descriptor including geometric and fimctional constraints derived &om. 
the architecture and chemical nature of the binding cavity or cavities. The descriptor may 
then be used to interrogate the compound database, a potential modulator being a compound 
that has a good match to the features of the descriptor. In effect, the descriptor is a type of 

20 virtual pharmacophore. 

In any event, the determination of the three-dimensional stmcture of BACE provides a basis 
for the design of new and specific ligands for BACE. For example, knowing the three- 
dimensional stracture of BACE, computer modelling programs may be used to design 
different molecules expected to interact with possible or confirmed binding cavities or other 
25 structural or functional features of BACE. Examples of this are discussed in Schneider, G.; 
Bohm, H. J. Drug Discov.Today 2002, 7, 64-70. 

More specifically, the interaction of a compound with BACE can be examined through the 
use of computer modelling using a docking program such as GOLD (Jones et al., J, Mol 
Biol, 245, 43-53 (1995), Jones et al., J, Mol Biol, 267, lll-lAi (1997)), GRAMM 
30 (Vakser, LA., Proteins , SuppL, 1 :226-230 (1997)), DOCK (Kuntz et al, JMolBiol 1982 , 
161, 269-288, Makino et al, XComputChem. 1997, 18, 1812-1825), AUTODOCK 
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(Goodsell et al. Proteins 1990, 8, 195-202, Morris et al, J,ComputChem, 1998, 19, 1639- 
1662.), FlexX, (Rarey et al, JMolBiol 1996, 261, 470-489) or ICM (Abagyan et al, 
J.Comput.Chem. 1994, 75, 488-506). This procedure can include computer fitting of 
compounds to BACE to ascertain how well the shape and the chemical structure of the 
5 compound will bind to the BACE. 

Also computer-assisted, manual examination of the binding site structure of BACE may be 
performed. The use of programs such as GRID (Goodford, J, Med Chem,, 28, (1985), 849- 
857) - a program that determines probable interaction sites between molecules with various 
functional groups and an enzyme surface - may also be used to analyse the binding cavity or 
10 cavities to predict partial structures of inhibiting compounds. 

Computer programs can be employed to estimate the attraction, repulsion, and steric 
hindrance of the two binding partners (i.e. the BACE and a candidiate modulator). 
Generally the tighter the fit, the fewer the steric hindrances, and the greater the attractive 
forces, the more potent the potential modulator since these properties are consistent with a 
15 tighter binding constant. Furthermore, the more specificity in the design of a potential dmg, 
the more likely it is that the drug will not interact with other proteins as well. This will tend 
to miiiimise potential side-effects due to unwanted interactions with other proteins. 

In another aspect, the present invention provides a method for identifying an agent 
compound (e.g. an inhibitor) which modulates BACE activity, comprising the steps of: (a) 
20 employing three-dimensional atomic coordmate data according to Table 1 to characterise at 
least one BACE binding site and preferably a plurality of BACE binding sites; (b) providing 
the structure of a candidate agent compound; (c) fitting the candidate agent compound to the 
binding sites; and (d) selecting the candidate agent compound. 

Preferably sufficient binding sites are characterised to define a BACE binding cavity or 
25 cavities. 

A plurality (for example two, three or four) of (typically spaced) BACE binding sites may 
be characterised and a plurality of respective compounds designed or selected. The agent 
compound may then be fomied by linking the respective compounds into a larger compound 
which preferably maintains the relative positions and orientations of the respective 
30 compounds at the binding sites. The larger compound may be formed as a real molecule or 
by computer modelling. 
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In one embodiment a plurality of candidate agent compounds are screened or interrogated 
for interaction with the binding sites. In one example, step (b) involves providing the 
structures of the candidate agent compounds, each of which is then fitted in step (c) to 
computationally screen a database of compounds (such as the Cambridge Structural 
Database) for interaction with the binding sites, i.e. the candidate agent compound may be 
selected by computationally screening a database of compounds for interaction with the 
binding sites (see Martin, J. Med. Chem,^ vol 35, 2145-2154 (1992)). In another example, a 
3-D descriptor for the agent compound is derived, the descriptor including e.g. geometric 
and functional constraints derived firom the architecture and chemical nature of the binding 
cavity or cavities. The descriptor may then be used to interrogate the compound databeise, 
the identified agent compound being the compound which matches with the features of the 
descriptor. In effect, the descriptor is a type of virtual pharmacophore. 

In a related aspect, the present invention provides a method for identifying a candidate 
modulator (e.g. potential inhibitor) of B ACE comprising the steps of: (a) employing a three- 
15 dimensional structure of BACE, at least one sub-domain thereof, or a plurality of atoms 
thereof, to characterise at least one BACE binding cavity, the three-dimensional structure 
being defined by atomic coordinate data according to Table 1; and (b) identifying the 
candidate modulator by designing or selecting a compound for interaction with the binding 
cavity. 

20 Detailed structural information can then be obtained about the binding of the compound to 
BACE, and in the light of this information adjustments can be made to the structure or 
functionality of the compound, e.g. to improve its interaction with BACE. The above steps 
may be repeated and re-repeated as necessary. 

K. Compound selection 

25 In another aspect, in place of in silico methods, high throughput screening of compounds to 
select compounds with binding activity may be undertaken, and those compounds which 
show binding activity may be selected as possible candidate modulators, and further 
crystallized with BACE (e.g. by co-crystallization or by soaking) for X-ray analysis. The 
resulting X-ray structure may be compared with that of Table 1 for a variety of purposes- 
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L, Compounds of the invention 

Having designed or selected possible binding candidate modiolators (e.g. by in silico 
analysis, "wet" chenxical methods. X-ray analysis etc.) by determining those which have 
favourable fitting properties (e.g. strong attraction between candidate and BACE), these can 
5 then be screened for activity. 

Consequently all the methods of compound design and identification outlined above can 
optionally include the step of: (a) obtaining or synthesising the candidate modulator; and 
(b) contacting the candidate modulator with BACE to determine the ability of the candidate 
modulator to interact with BACE. 

10 More preferably, in the latter step the candidate modulator is contacted with BACE xmder 
conditions to determine its fimction. 

For example, in the contacting step above the candidate modulator is contacted with BACE 
in the presence of a substrate, and typically a buffer, to detemiine the ability of said 
candidate modulator to inhibit BACE. The substrate may be e.g. APP. So, for example, an 
15 assay mixture for BACE may be produced which comprises the candidate modulator, 
substrate and bufTer. 

Detailed structural information can be obtained about the binding of the candidate 
modulator to BACE, and in the light of this information adjustments can be made to the 
structure or fiinctionality of the candidate modulator, e.g. to improve binding to the binding 
20 cavity or cavities. The above steps may be repeated and re-repeated as necessary. 

Following identification of such compounds, it may be manufactured and/or used in the 
preparation, i.e. manufacture or formulation, of a composition such as a medicament, 
pharmaceutical composition or drug. These may be administered to individuals. 

Thus, the present invention extends in various aspects not only to a compound as provided 
25 by the invention, but also a pharmaceutical composition, medicament, dmg or other 
composition comprising such a compound e.g. for treatment (which may include 
preventative treatment) of disease; a method comprising administration of such a 
composition to a patient, e.g. for treatment of disease; use of such an inhibitor in thie 
manufacture of a composition for administration, e.g. for treatment of disease; and a method 
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of making a pharmaceutical composition comprising admixing such an inhibitor with a 
pharmaceutically acceptable excipient, vehicle or carrier, and optionally other ingredients. 

Thus a further aspect of the present invention provides a method for preparing a 
medicament, pharmaceutical composition or drug, the method comprising: 

5 (a) identifying a BACE modulator molecule (which may thus be termed a lead compoimd) 
by a method of any one of the other aspects of the invention disclosed herein; (b) optimising 
the structure of the modulator molecule; and (c) preparing a medicament, pharmaceutical 
composition or drug containing the optimised modulator molecule. 

The above-described processes of the invention may be iterated in that the modified 
10 compoxmd may itself be the basis for further compound design. 

By "optimising the structure" we mean e.g. adding molecular scaffolding, adding or varying 
functional groups, or coimecting the molecule with other molecules (e.g. using a firagment 
linking approach) such that the chemical structure of the modulator molecule is changed 
while its original modulating functionality is maintained or enhanced. Such optimisation is 
15 regularly undertaken during drug development programmes to e.g. enhance potency, 

promote pharmacological acceptability, increase chemical stability etc. of lead compounds. 

Modification will be those conventional in the art known to the skilled medicinal chemist, 
and will include, for example, substitutions or removal of groups containing residues which 
interact with the amino acid side chain groups of a BACE structure of the invention. For 

20 example, the replacements may include the addition or removal of groups in order to 

decrease or increase the charge of a group in a test compound, the replacement of a charge 
group with a group of the opposite charge, or the replacement of a hydrophobic group with 
a hydrophilic group or vice versa. It will be imderstood that these are only examples of the 
type of substitutions considered by medicinal chemists in the development of new 

25 pharmaceutical compounds and other modifications may be made, depending upon the 
nature of the starting compound and its activity. 

Compositions may be formulated for any suitable route and means of administration. 
Pharmaceutically acceptable carriers or diluents include those used in formulations suitable 
for oral, rectal, nasal, topical (including buccal and sublingual), vaginal or parenteral 
30 (including subcutaneous, intramuscular, intravenous, intradermal, intrathecal and epidural) 
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administration. The formulations may conveniently be presented in unit dosage form and 
may be prepared by any of the methods well known in the art of pharmacy. 

For solid compositions, conventional non-toxic solid carriers include, for example, 
pharmaceutical grades of mannitol, lactose, cellulose, cellulose derivatives, starch, 
magnesium stearate, sodium saccharin, talcum, glucose, sucrose, magnesium carbonate, and 
the like may be used. Liquid pharmaceutically administrable compositions can, for 
example, be prepared by dissolving, dispersing, etc, an active compoimd as defined above 
and optional pharmaceutical adjuvants in a carrier, such as, for example, water, saline 
aqueous dextrose, glycerol, ethanol, and the like, to thereby form a solution or suspension. 
If desired, the pharmaceutical composition to be administered may also contain minor 
amoimts of non-toxic auxiliary substances such as wetting or emulsifying agents, pH 
buffering agents and the like, for example, sodium acetate, sorbitan monolaurate, 
triethanolamine sodium acetate, sorbitan monolaurate, triethanolamine oleate, etc. Actual 
methods of preparing such dosage forms are known, or will be apparent, to those skilled in 
this art; for example, see Remington's Pharmaceutical Sciences, Mack Publishing 
Company, Easton, Pennsylvania, 15th Edition, 1975. 

Compositions may be used, e.g. for treatment (which may include preventative treatment) of 
a disease such as Alzheimer's disease or Alzheimer' s-type pathology in Downs syndrome. 
Thus the invention provides a method comprising administration of such a composition to a 
patient, e.g. for treatment of a disease such as Alzheimer's disease; use of such an agent 
compound in the manufacture of a composition for administration, e.g. for treatment of a 
disease such as Alzheimer's disease; and a method of making a pharmaceutical composition 
comprising admixing such an agent compound with a pharmaceutically acceptable 
excipient, vehicle or carrier, and optionally other ingredients. 

Exemplification 

The invention will now be described with reference to specific Examples. These are merely 
exemplary and for illustrative purposes only; they are not intended to be limiting in any 
way to the scope of the invention described. These examples constitute the best mode 
currently contemplated for practicing the invention. 
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BACE protease was expressed at high levels m bacterial cells as insoluble inclusion bodies. 
To prepare functional protein for enzyme assay and structural studies these inclusion bodies 
were solublised using denaturants; the slow removal of these denaturants allowed the 
formation of the correct tertiary structure. In the method described here, BACE was 
5 expressed as a pro-sequence and required activation by a protease before becoming fully 
functional. Clostripain was used as an activating protease but produced multiple species of 
BACE as determined by mass spectrometry. In order to obtain a uniform homogenous 
protein after activation by clostripain, a number of different constructs were produced. 
These constructs focused on the mutation of two undesireable clostripain cleavage sites 
10 (following residues R56 and R57). 

Cloning of BACE WT and BACE N->0 

The full-length DNA coding sequence of BACE was cloned from human cerebellum and 
human dorsal root ganglion (DRG) cDNA by PCR using oligonucleotide primers based on 
the published BACE sequence (EMBL accession no. AF 190725). The full-length template 
15 sequence was obtained by PCR amplification using the following primers: hBACE-spl and 
-apl were used for primary amplification, hBACE-sp2 and -ap2 for nested PCR. ' 

The primers were as follows: 

hBACE-spl 5'-AGCTCCCTCTCCTGAGAAGCCACC-3' (SEQ ID NO:22) 

hBACE-apl 5'-CCACAGGTGCCATCTGTGTCTCC-3* (SEQ ID NO:23) 

20 hBACE-sp2 5'-CACCAGCACCACCCAGACTTGG-3* (SEQ ID NO:24) 

hBACE-ap2 5'-AACCACGGAGGTGTGGTCCAGG-3' (SEQ ID NO:25) 

A cDNA construct encoding a modified BACE form was made as follows. A partial BACE 
cDNA fragment was amplified using the full-length BACE clone as a template with primers 
hBACE_EC(Bam-M-14)_FOR (5' - CGG GAT CCA TGG CGG GAG TGC TGC CTG CC 
25 - 3*) and hBACE_EC(Bam-453)_REV (5' - CGG GAT CCT TAT GAC TCA TCT GTC 
TGT GGA ATG TTG TAG C - T). The resultmg 1342 bp PCR fragment was subcloned in 
vector pCR2. 1 -TOPO using the TOPO TA cloning® kit (Invitrogen) according to the 
manufacturer's instructions. The inserts of several resulting clones were fully sequenced 
and a clone containing no PCR mistakes was selected. The insert of this clone was excised 
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from the pCR2. 1 -TOPO construct using the BamHI restriction endonuclease and subcloned 
to vector pETl la (Novagen) linearized with BamWi. The BACE coding sequence (BACE 
WT, SEQ ED 1) in the resulting clones was confirmed by sequence analysis and the 
resulting correct construct was named M-T7-RGSM(BACE14-453)/pETl la. 

5 Plasmid M-T7-RGSM(BACE14-453)/pETl la encodes a 455 amino acid residue protein 
named BACE WT containing a T7 epitope tag encoded by the pBTl la vector sequence (AA 
1 to 1 1), a linker sequence (AA 12-15; RGSM) and the partial BACE amino acid sequence 
from residue 14 to 453 (AA 16 to 455)(numbermg based on SEQ ID 2). The calculated 
molecular mass of the resulting protein is 50,2 kDa. 

10 The insert from construct Plasmid M-T7-RGSM(BACE14-453)/pETl la was amplified by 
PCR to incorporate a His6 tag (CAT CAC CAT CAT CAC CAC) just upstream of the stop 
codon and BamHI site. Following cloning of this amplified fragment back into the original 
expression vector, the asparagine residues at positions -153, -172, -223 and -354 (mimbers 
refer to the database BACE sequence BACE_HUMAN, P56817 in Swissprot) were mutated 

15 to glutamine (AAC to CAA) using the Quikchange™ mutagenesis system (Stratagene, used 
according to the manufacturers instructions), to generate BACE N->Q (SEQ ID 3). 

Introduction of Activation Site Mutations 

BACE WT and BACE N->Q, described above, were mutated using the Quickchange"^" site 
directed mutagenesis protocol (Stratagene). Two complimentary oligonucleotides were 
20 designed which spanned the site of the mutation and which incorporated the amino acids 
changes to be made. These oligonucleotides were then used as primers in a PCR reaction 
producing each of the strands of the plasmid with the mutation present; the parental plasmid 
is digested with the methylation sensitive restriction endonuclease Dpnl and then 
transformed into competent Kcoli cells. 

25 Primers were applicable for the mutation of both BACE WT and BACE N->Q due to their 
high sequence homology. Seven constructs were produced; these are detailed below with 
the ohgonucleotide sequence used to make the constructs. 

1) BACE WT mutating arginine 56 to lysine and arginine 57 to lysine (SEQ ID 5) 



5' - CCCGAGGAGCCCGGCAAGAAGGGCAGCTTTGTGGAGATG-3' (SEQ ID 
30 NO:26) 
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5' - CATCTCCACAAAGCTGCCCTTCTTGCCGGGCTCCTCGGG- 3' (SEQ ID NO:27) 

2) BACE WT mutating arginine 57 to lysine (SEQ ID 7) 

5' - CCCGAGGAGCCCGGCCGGAAGGGCAGCTTTGTGGAGATGG - 3' (SEQ ID 
NO:28) 

5 5' - CCATCTCCACAAAGCTGCCCTTCCGGCCGGGCTCCTCGGG - 3 ' (SEQ ID 
NO:29) 

3) BACE WT deleting arginine 57 (SEQ ID 9) 

5' - CCCGAGGAGCCCGGCAGGGGCAGCTTTGTGGAGATGGTGGAC - 3' (SEQ ID 
NO:30) 

10 5' - GTCCACCATCTCCACAAAGCTGCCCCTGCCGGGCTCCTCGGG - 3 ' (SEQ ID 
NO:31) 

4) BACE N->Q mutating arginine 56 to lysine and arginine 57 to lysine (SEQ ID 11) 

5' - CCCGAGGAGCCCGGCAAGAAGGGCAGCTTTGTGGAGATG - 3' (SEQ ID 
NO:32) 

15 5' - CATCTCCACAAAGCTGCCCTTCTTGCCGGGCTCCTCGGG - 3' (SEQ ID NO:33) 

5) BACE N->Q mutating arginine 57 to lysine (SEQ ID 15) 

5' - CCCGAGGAGCCCGGCCGGAAGGGCAGCTTTGTGGAGATGG - 3' (SEQ ID 
NO:34) 

5' - CCATCTCCACAAAGCTGCCCTTCCGGCCGGGCTCCTCGGG - 3' (SEQ ID 
20 NO:35) 

6) BACE N->Q deleting arginine 57 (SEQ ID 17) 

5' - CCCGAGGAGCCCGGCAGGGGCAGCTTTGTGGAGATGGTGGAC - 3' (SEQ ID 
NO:36) 

5' - GTCCACCATCTCCACAAAGCTGCCCCTGCCGGGCTCCTCGGG - 3' (SEQ ID 
25 NO:37) 
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7) BACE N->Q mutating arginine 56 to lysine and arginine 57 to lysine and removing the C 
temiinal poly histidine tag (SEQ ID 13) 

5' - CCCGAGGAGCCCGGCAAGAAGGGCAGCTTTGTGGAGATG- 3' (SEQ ID 
NO:38) 

5 5' - CATCTCCACAAAGCTGCCCTTCTTGCCGGGCTCCTCGGG - 3 ' (SEQ ID NO:39) 

5' - CCACAGACAGATGAGTCATGACACCATCATCACCACTAAG - 3' (SEQ ED 
NO:40) 

5' - CTTAGTGGTGATGATGGTGTCATGACTCATCTGTCTGTGG - 3 ' (SEQ ID 
N0:41) 

10 After transformation of the plasmid the protein coding region was checked by DNA 
sequencing. 

Protein production (l^ 

Plasmid constructs were transformed into BLR(DE3) as follows: 1-2 jil DNA was added 
into 25ul BLR(DE3) competent cells. Cells were then heat shocked at 42°C for 45secs, 
15 followed by incubation for 30mins at 4''C . The sample was placed on ice for 2-3 minsi 

before addition of 125-250ul HOC medium and left for 60 mms at 37*'C. Cells were plated 
out onto agar containing carbenicillin & incubated at 37°C for 16h. Transformations were 
stored at 4°C. Transformed cells could be used up to after 8 weeks storage. 

Colonies were inoculated in 100 ml LB broth with ImM carbenicillin, and shaken for 16h at 
20 25°C. 12 ml of this culture was added to 1 L of the same medium in baffle flasks. The 
typical total culture volume was 12, 20 or 24 L. Cells were induced by addition of ImM 
IPTG at approximately OD600 1 .0. Cells were harvested 3 to 4 hoxirs after induction by 
centrifiigation for 7 min at 16 000 g. Cell pellets were resuspended in 1 litre TN buffer 
(150mM NaCl, 50mM Tris, pH 7.5) before addition of 10 mg lysozyme per litre of 
25 bacterial culture. The suspension was left for 20 xnins under vigorous stirring then frozen at 
-70°C. 



The lysates were thawed & adjusted to 1 mM MgC12 and 20 pi 10 mg/ml DNAse, incubated 
30-60 mins at 20*C, then 0.1 % Triton X-100 was added. Inclusion body washes were 
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performed in 11 wash steps, spun down at 13,000-16,000 g for 20mins at room temperature 
then resuspended by sonication in TNT buffer (TN buffer + 0.1% Triton 100). The washing 
step with TNT was repeated at least three times (up to seven times) until an almost 
homogenous dark cream precipitate was obtained. At this stage the pellet was washed twice 
5 with TN buffer. The typical yield for a 12 L culture of BACE WT constructs was 4.5 g 
washed inclusion body material. 

Protein Refolding fn 

Each g of inclusion bodies was solubilised with 22.5 ml of 8 M urea, 50 mM Tris, 0.1 M 
beta-mercaptoethanol, 10 mM DTT, 1 mM EDTA. After 2 to 3 hours under gentle stirring, 
10 this was spun at 48 400 g for 25mins. This was then diluted 1 in 10 in 8 M Urea, 0.2 mM 
oxidized glutathione, 1.0 mM reduced glutathione. This is the starting solution for refolding 

Refolding was accomplished by dilution into 20 volumes 20 mM Tris, 10 mM NDSB256 
(3-(benzyldimethylammomo)propanesulfonate). The addition was achieved by slowly 
dripping from a burette into a strongly stirred solution. Addition was carried out at room 
15 temperature. 

The pH was adjusted to approximately 9 using 13.5 ml 1 N HCl per 5 litre of refolding mix 
either immediately after dilution or 16 h after dilution. This was left at 4**C for 2-3 weeks. 
The refolding mix was then adjusted to pH 8.2 16h before concentratmg. In instances where 
a longer incubation was applied it appeared that yields were slightly better. No precipitation 
20 was seen when attempting to refold BACE, even in totally unsuccessful conditions. 
Constructs BACE WT R57K, BACE WT R57DEL, BACE N->Q R57K, and BACE 
R57DEL refolded with lower yields. 

Protein Purification of BACE firom refolding step m 

The refolded protein sample was concentrated by ultrafiltration using two parallel Vivaflow 
25 200 cells (MWCO 30Kda), fed by a single pump. The concentration factor was not more 
than 200 times: if exceeded, precipitation occurred. 

Concentrated refolded BACE was loaded and eluted on a 1.75 L Sephacryl 300 column run 
at a flow of 0.2 cm-l/min in 0.4 M Urea, 20 mM Tris, 10 mM HCL Typical loading volume 
was 2% bed volume. From reconcentrated material three peaks are observed, the fibrst one 
30 near the void volume (large aggregates), which merges into a second peak of aggregated 
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inactive material. The third peak (elutes at approx 40% of column volume) constitutes 
active BACE, For BACE WT constructs, the active fraction elutes at approximately 800ml. 

Activation bv Clostripain (1) 

Clostripain (Cp; EC 3.4.22.8, from Worthington or Sigma C7403) v^as activated before use 
5 by solubilising the freeze dried material to 1.25 mg/ml in: 20 mM Calcium Acetate, 8 mM 
DTT, 100 mM Tris, pH 8 at 1 .25 mg/ml 4 °C for at least Ih. The preparation was then 
stable at 4 ®C for up to four weeks. 

The third peak (typically 100 ml at an average of 0.3 mg ml) from Sephacryl 300 elution 
was treated with activated Cp, (1/100 dilution) for between 30-90mins at room temperature. 

10 Activation of BACE WT R56KR57K, BACE N->Q R56KR57K & BACE N->Q 

R56KR57K no His by clostripain was performed as described above except that prior to 
activation the solution was concentrated ten fold using Vivaspin 20 ml 30 KDa MWCO. 

The reaction was stopped by loading onto a Mono Q HR5-5 colxmm equilibrated in 0.4 M 
Urea, 20 mM Tris, 10 mM HCl, 1 mM EDTA followed by washing using the same buffer. 
15 The protein was eluted with a 0 to 1 M NaCl gradient over 1 0 column volumes. A typical 
final yield of active soluble BACE WT R56KR57K is 1-2 mg of protehi per litre of culture 
grown. The eluted protein was characterised and used in crystallisation assays. 

Protein Production (2^ 

BLR (DE3) competent cells were transformed as described earlier and plated onto agar 
containing ampicillin (Amp). A colony was picked into 250ml LB + lOOug/ml Amp and 
grown overnight @ 37^C, ISSrpm. Following overnight growth (ODeoo varied between 2.0- 
2.5) 10ml of this culture was used to inoculate IL of fresh LB+100 |xg/ml Amp in a 2L 
baSled flask. Routinely 24L of fresh LB+Amp would be inoculated from the overnight 
growth. Following inoculation, the 24L prep would be grown at 37^C, 1 85rpm until an 
ODeoo =1.0 was obtained. Protein expression was induced by the addition of EPTG to a final 
concentration of ImM. Cultures were incubated for a fiirther 3 hours (at 37^C, ISSrpm) 
before harvesting by centrifiigation at 8000 rpm for 10 mins (JLA 8.1000). Cell pellets 
could be stored at -80^C or processed immediately. 



20 



25 



wo 2004/011641 PCT/GB2003/003200 

60 

All following protein production procedures were performed at room temperature imless 
stated otherwise. Cell pellet was re-suspended in 500ml of TN bxiffer (TN buffer - 150mM 
NaCl, 50mM Tris, pH7.5). 240mg of egg lysozyme (lOmg/L of bacterial culture) was added 
to the re-suspended pellet. The suspension was left stirring for 20mins. Following this, 
5 lOOul of DNase 1 (lOmg/ml stock) was added to the suspension and this was left stirring for 
20mins. This lysate was clarified by centrifiigation at SOOOrpm for 20mins (JLA8.1000). 
The supernatant was discarded and the pellet was re-suspended in 100ml TNT buffer (TNT 
buffer - 1 SOroM NaCl, 50mM Tris, pH7.5, 0.1% Triton X-100). Efifort was made to break 
up any lumps present in the pellet so that a homogenous re-suspension was obtained. 

10 Following this, the re-suspension was sonicated for 2 mins (20 sec pulses). 400ml of TNT 
buffer was added to bring the volume of the suspension up to -SOOmls, This was 
centriftiged for 20mins at SOOOrpm and the supernatant discarded. The re-suspension in 
TNT buffer and sonicadon steps, as described above, were repeated twice. Following these 
three TNT washes, the pellet was re-suspended in lOOml of TN buffer and sonicated for 2 

15 mins (20 second pulses). The suspension was centrifuged for 20 mins at SOOOrpm. This 
wash in TN buffer was repeated once. Approximately 12-1 5g of inclusion bodies was 
obtained from the 24L of culture. 

Protem Refolding (2) 

The inclusion body preparation was solubilised by addition of lOOmls of solubilisation 
20 buffer (Sol. Buffer - SM urea, 50mM Tris, O.IM beta-mercaptoethanol, lOmM DTT, ImM 
EDTA). Effort was made to break up the inclusion body pellet using a pipette/spatula. The 
solution was left stirring gently overnight. The suspension was centriftiged for 30 mins at 
25,000rpm (JA25). The supernatant (-lOOmls) was diluted by the addition of 900mls of SM 
urea, 0.2mM oxidised glutathione, l.OmM reduced glutathione. 

25 The IL of solubilised inclusion bodies as prepared above were refolded by a further 20x 
dilution. A 250ml aliquot of solubilised inclusion body prep was added drop-wise to 4.75L 
of refolduxg buffer (Refolding buffer - 20mM Tris, lOmM NDSB256 (3- 
(ben2yldimethylammonio)propanesulfonate). The 4.75L of refolding buffer was stirred 
vigorously (not foaming) and the 250mls of inclusion body prep was added using a 

30 peristaltic pump. Care was taken to add the 250mls at a fast drop rather than a continuous 
pour. The remaining 750mls of inclusion body prep was diluted in the same way (250mls 
into 4.75L of refolding buffer). The four 5L vessels were placed at 4^C overnight 
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Following overnight incubation at 4^C, the pH of each 5L vessel was adjusted to pH9.0 by 
addition of cone HCl. The vessels were then placed back at 4^C and left for 3 weeks. 

Protein PurijScation of BACE fro m Refoldin g Step (2) 

Two parallel Vivaflow 200 cells (MWCO 30Kda) fed by a single peristaltic pump were 
used. Each 5L of refolding mix was concentrated to -50mls. Over concentrating leads to 
precipitation and should be avoided. The concentration of 5L of refolding mix took -2 
hours. The 50mls of concentrated refolding mix was centrifliged for 25 mins, at 25,000rpm. 
The supernatant was then ready for gel filtration using a Sephacryl S-300 column (100x3.5 
). This method is limited by the volume of concentrated refolding mix than can be loaded 
onto the gel filtration column (50mls) per run. Sephacryl S-300 column was equilibrated 
with 0.4M urea, 20mM Tris, lOmM HCl (at a flow rate of 4ml/mm). 50ml of sample can be 
loaded per run. The column was run at a flow rate of 4ml/niin. SDS PAGE analysis of peaks 
1,2 and 3 showed the presence of BACE (50Kda band) however activity assay of all three 
peaks showed only active BACE in peak 3. Fractions from Peak 3 were pooled and kept on 
ice. 

Activation by Clostripain (2) 

Clostripain (Sigma C7403) was prepared by dissolving protein to a final concentration of 
1.25mg/ml in 20mM Calcium acetate, 8mM DTT, lOOmM Tris pH 8.0. The clostripain was 
activated by incubating on ice for 1 hour prior to use. 

Pooled fractions from peak 3 (-lOOml at 0.2mg/ml) were activated by the addition of 1/100 
dilution of L25mg/ml clostripain. The reaction was incubated at 37^C in a water bath for 90 
miautes. The reaction was stopped by addition of ImM EDTA and placed on ice. Note: 
With each fresh batch of Sigma Clostripain, a time trial was performed on a small amount 
of BACE to verify the length of incubation needed at 37°C. The length of incubation varied 
from 30-90 mins. Analysis by SDS PAGE clearly showed the appearance of the lower 
molecular weight activated species (~47Kda) from the larger inactivated species (-'SOKda). 

A Mono Q 5/5 ion exchange column was pre-equilibrated in 0.4M urea, 20mM Tris, lOmM 
HCl. The activated BACE (-SOmls at -0.2mg/ml) was loaded onto the Mono Q column at a 
flow rate of 1 .Oml/min. Activated BACE was purified by applying a linear salt gradient 
(0.4M urea, 20mM Tris, lOmM HCl, l.OM NaCl) over 20 columin volumes. Following 
analysis by SDS PAGE and subsequent activity assay, fractions corresponding to activated 
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BACE were pooled and buffer exchanged into crystallisation buffer (20mM Tris, pH8.2, 
ISOmMNaCl, ImMDTT). 

Protein Purification of BACE from Refolding Step (3^ 

By using method 3 in conjunction with the S-200 INDEX gel filtration column, all 20L of 
refolding mix could be processed in one go. 

A Sartocon filtration cassette (MWCO 30Kda) was used in conjunction with a Watson 
Marlow 623 S high speed pump. This assembly was set up as described in the manufactures 
operation manual. The 20L of refolding mix was concentrated down to --SOOmls in less than 
1 hour. Due to the dead volume in the assembly tubing, the volume could not be reduced 
further. At this stage the SOOmls of concentrated refolding mix was filtered using a 0.2um 
filter. The filtered sample was then ready for gel filtration ixsing an S-200 INDEX gel 
filtration colunm (100x10.0). A S-200 INDEX column pre-equilibrated in 0.4M urea, 
20mm Tris, lOmM HCl was used. The column run was at a flow rate of lOmls/min. 

SDS analysis of peaks 1 ,2 and 3 showed that BACE was present in all fractions. Activity 
assay showed that only peak 3 contain some BACE activity. Fractions from peak 3 were 
pooled (~250mls at O.lmg/ml). 

Prior to clostripain activation, the BACE sample was concentrated using a Resource Q ion 
exchange column. A 6/1 Resource Q colunm was pre-equilibrated in 0.4M urea, 20mM 
Tris, lOmM HCL The Bace sample was loaded onto the column at 7ml/min. BACE was 
eluted off the column using a linear salt gradient (0.4M xirea, 20mM Tris, lOmM HCl, IM 
NaCl) over 5 column volumes. This step has the effect of dramatically reducing the sample 
volimie size. Prior to clostripain activation, the protein sample is diluted with 0.4M urea, 
20mM Tris, lOmM HCl to reduce the salt concentration to enable fiarther purification using 
Mono Q. A dilution factor of 5:1 has been used successfiilly. 

This is then followed by Clostripain Activation and Mono Q purification as outlined above. 
Protein Characterization 

The quality of the final preparation was evaluated by: 

(a) SDS volvacrvlamide sel electrophoresis, performed using commercial gels (Novagen) 
followed by Coomassie Brilliant Blue staining according to the manufacturer's instructions. 
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The purity as estimated by scanning a digital image of a gel was estimated to be at least 
95%. 

(b) Mass Spectroscopy : The eluted peak(s) were analysed using ESI-TOF-MS. Mass 
spectroscopy was performed using a Bniker '*BioTOF" electrospray time of flight 
instrument. Samples were either diluted by a factor of 1000 straight from storage buffer into 
methanol/water/fomiic acid (50:48:2 v/v/v), or subjected to reverse phase HPLC separation 
using a C4 column. Calibration was achieved using Bombesin and angiotensin I using the 
2+ and 1+ charged states. Data were acquired between 200 and 2000m/z range and were 
subsequently processed usmg Bruker's X-mass program. Mass accuracy was typically 
below 1 m 10 000, 

MS Analvsis of EACH WT R56KR57K rSEO ID NO:6) 
Full-length protein: MASMTGGQQMGRGSMAGVLPAHGT... 

Predicted mass of full-length protein: 50147 

Cleavage position: 

MASMTGGQQMGR 4^ GSMAGVLPAHGT. . . 

Predicted mass of BACE protein: 489 1 1 . This is the first intermediate fragment and is 
obtained very quickly and can be obtained as a stable fragment at lower enzyme 
concentration. 

Cleavage position: 

MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLR >l 
LPRETDEEP... 

Predicted mass of BACE protein: 45781. This is the final fragment obtained in the 
conditions described above. Observed ES-MS spectra of this fi:agment deconvolutes to a 
parent mass of 45783. The fr^ment typically elutes as a single peak from the Mono Q 5.5. 

Mass Spec Analvsis of BACE N->0 R56KR57K (SEP ID NO:12^ 
Predicted mass of fiiU-length protein: . 50895 
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MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLR 4. 
LPRETDEEP... 

Predicted mass of BACE protein: 46660.65. This is the final fragment obtained in the 
5 conditions described above. Observed ES-MS spectra of this fragment deconvolutes to a 
parent mass of 46655. The fragment typically elutes as two peaks from the Mono Q 5.5, the 
first corresponding to the desired fragment. 

Mass Spec Analysis of BACE N->0 R56KR57K no His (SEP ID NO: 14^ 
Predicted mass of full-length protein: 50072.73 

1 0 Cleavage position: 

MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLR >l. 
LPRETDEEP... 

Predicted mass of BACE protein: 45837.80. This is the first intermediate fragment, 
obtained rapidly between 30-60 minutes post activation and is suitable for crystallisation. 
15 Observed ES-MS spectra of this fragment deconvolutes to a parent mass of 45838.30. 

Typically elutes as 2 peaks from the Mono Q 5.5, the first peak corresponding to the desired 
fragment. 

Cleavage position: 

MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEEPEE 
20 PGK >!. KGSFVEMV. . . 

Predicted fragment mass: 44230. 1 1 . Further digestion beyond 60 minutes promotes the 
formation of the above fragment, not suitable for crystallisation. Observed ES-MS spectra 
of this fragment deconvolutes to a parent mass of 44228.03. This typically elutes as peak 2 
from the Mono Q 5.5. 

25 Method for Determinin g Activity of BACE 

A fluorimetric assay was used to measure the activity of the refolded proteins. Activity of 
the BACE enzyme was measured using the fluorescent peptide R-E(EDANS)-E-V-N-L-*D- 
A-E-F-K(DABCYL)-R-OH (Bachem) as substrate. Assays were carried out in 96-well 
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' black, flat-bottomed Cliniplates in a final assay volume of 1 OOul. The reaction rate was 
monitored at room temperature on a Fluoroskan Ascent plate reader with excitation and 
emission wavelengths of 355nm and 530nm respectively. 

To determine the pH profile for the enzyme 8 nM B ACE was incubated with 1 0 jiM 
5 substrate in 50 mM sodium acetate (pH 3.5-5.5) or MES (pH 5.5-6.5) buffers at varying pHs 
and5%DMSO. 

For kinetic characterization of the enzyme 8 nM BACE enzyme was incubated with varying 
concentrations of the substrate (2.5 - 80 pM) in 50 mM sodium acetate, pH 5, 5 % DMSO 
and the reaction monitored as described above. Kinetic parameters were determined by the 
10 standard Michaelis-Menten equation, using Prizm (GraphPad) software. ImM OM 99 
completely inhibits activity. 

Protein Crystallisation 

The sample of BACE was buffer exchanged into 20 mM Tris.HCl pH8.2, 150 mM NaCl, 1 
mM DTT and concentrated down to approximately 7 mg/ml as determined by its theoretical 
15 extinction coefficient Prior to crystallisation, the sample was spun at 55,000 rpm for 30 
min using a Beckman benchtop ultracentrifuge. DMSO was added to a final concentration 
of 3 % (v/v). 

Crystals of BACE from BACE WT R56KR57K, BACE N->Q R56KR57K & BACE N->Q 
R56KR57K no His were obtained by the hanging-vapour difiusion method at 20 ^'C using 
20 1 .5 |il of protein and an equivalent volume of reservoir solution. The reservoir solution 

contained 20-24 % PEG 5000 MME, 180-220 mM (e.g. 200 mM) ammonium iodide, 180- 
220 mM (e.g. 200 mM) tri-sodixma citrate (pH 6.4-6.6). In an altemative, the reservoir 
solution may additionally contain 2.5% v/v glycerol. 

Diffraction quality single crystals of BACE WT R56KR57K were obtained by the hanging- 
25 vapour diffusion method at 20 ''C using 1 .5 (xl of protein and an equivalent volume of 

reservoir solution. The reservour solution contained 20-22.5 % PEG 5000 MME, 180-220 
mM (e.g. 200 mM) anmionium iodide, 180-220 mM (e.g. 200 mM) tri-sodium citrate (pH 
6.4-6.6). 



Crystals appear within the first week and grow to maximum dimensions within 14 days. 
30 The crystals were hexagonal rods with approximate dimensions of 0.2 x 0.05 x 0.05 mm. 
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They belonged to the hexagonal space group P6\22 with cell parameters a = b = 103.2 A, c 
= 169.1 A and accommodate one enzyme molecule per asymmetric unit, and a solvent 
content of 66%. 

Inhibitor Soaking 

5 BACE inhibitors were dissolved in DMSO to a concentration of 500 mM and then diluted 1 
in 10 in a harvesting solution composed of 220 mM ammonium iodide, 220 mM sodium 
cacodylate pH 6,4 and 22% PEG 5K MME or 100-200 mM sodium citrate pH 5.0, 200 mM 
ammonium iodide and 30% PEG 5K MME. Apo-BACE protein crystals were transferred 
into the harvesting solution for a period of up to 24 hours prior to being dipped in 
10 cryoprotectant (20% PEG 5000 MME, 200 mM ammonium iodide, 200 mM sodium 
cacodylate pH 6.4 and 20% (v/v) glycerol or 200 mM sodium citrate pH 5.0, 200 mM 
ammonium iodide, 30% PEG 5K MME and 20% (v/v) glycerol) containing the inhibitor and 
frozen in liquid nitrogen. 

Data Collection & Processing 

15 The structure of apo-BACE was solved from BACE WT R56KR57K to 1 .75 A resolution 
using the method of molecular replacement. Prior to data collection, crystals were exposed, 
briefly, to cryoprotectant, described previously, before flash freezing. Data was collected at 
100 ^'K on beamline ID14-1 at the European Synchrotron Radiation Facility using an ADSC 
Quantum4 CCD detector, with a wavelength of 0.934A and processed using MOSFLM 

20 (LesUe, A. G. W. (1992). "bx Joint CCP4 and EESF-EACMB Newsletter on Protein 

Crystallography, vol. 26, Warrington, Daresbury Laboratory). The dataset was scaled using 
SCALA (CCP4 - Collaborative Computational Project 4. (1994) The CCP4 Suite: Programs 
for Protein Crystallography. Acta Crystallographica D50, 760-763) and the intensities 
converted to stmcture factor amplitudes with TRUNCATE (Evans, P. R. (1997). Scaling of 

25 MAD data. In Recent Advances in Phasing (ed. K. S. Wilson, G, Davies, A. W. Ashton and 
S. Bailey), pp. 97-102. Coimcil for the Central Laboratory of the Research Councils 
Daresbury Laboratory, Daresbury, UK), from the CCP4 suite of programs (CCP4 - 
Collaborative Computational Project 4. (1994) The CCP4 Suite: Programs for Protein 
Crystallography. Acta Crystallographica D50, 760-763). Statistics for the processing are 

30 shown in Table 2. 
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TABLE 2: Data collection statistics for apo-BACE. 



Resolution 


1.75 A 


Mosaicity 


0.34° 


Completeness 


95.9 % 


Multiplicity 


6.3 


Rmerge 


0.097 



Structure Determination and Refinement 

The structure of apo-BACE was solved by molecular replacement using the program EPMR 
5 (Kissinger CR, Gehlhaar DK, Fogel DB, Acta Crystallogr D Biol CrystaUogr, 1999,vol 55 
(Pt 2), 484-91). Initially, it was impossible to know whether the correct space group was 
P6i22 or P6522, therefore molecular replacement attempts were performed against botibi. 
Default parameters and a resolution range of 15-4A were used in conjunction with the A 
chain of IFKN (Hong et al, 2000) as the search model. A solution was found for P6i22 

10 with an Rfactor of 0.458 and a correlation coefficient of 0.543. In an attempt to reduce 
model bias, the molecular replacement solution was used as the starting point for 
ARP/wARP (Morris RJ, Perrakis A, Lamzin VS, Acta Crystallogr D Biol Crystallogr, 
2002,vol 58,(Pt 6 No 2), 968-75) to perform automated backbone tracing using warpNtrace 
and side chain building via the Side_dock procedure. This produced a discontinuous model 

15 composed of 244 out of 385 residues spanning 12 amino acid chains. Cycles of structural 
refinement with REFMAC5 (Murshudov, G. N., Vagin, A. A. and Dodson, E. J. (1997). 
Refinement of macromolecular structures by the maximum-likelihood method. Acta 
Crystallographica, 1997 D53, 240-255) were alternated with manual rebuilding of the 
model using QUANTA (Jones et al., Acta Crystallography A47 (1991), 1 10-1 19 and 

20 commercially available from Accelerys, San Diego, CA). The model was extended to 329 
residues with chain breaks between 156-170, 255-280 and 31 1-325. CNX (Brunger et al.. 
Current Opinion in Structural Biology ^ Vol. 8, Issue 5, October 1998, 606-61 1, and 
commercially available from Accelerys, San Diego, CA) composite omit maps were 
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generated to allow further biiilding of the structure and finally water molecules added using 
Denint (Astex internal software library). Refinement statistics are shown in Table 3. 

TABLE 3: Final refinement statistics for apo-BACE 



Rwork 


0.251 


Rfiee 


0.284 


RMS bond deviation from ideality 


0.011 


RMS bond angle deviation from ideality 


1.30 


Average Bfactor for structure 


32.99 



5 This data indicates that the final stmcture is of good quality; the Rfactors indicating that the 
refined model has a good agreement with the experimental data. The RMS deviations fi-om 
ideality indicate that the geometry of the model is good. 

Description of the Ado Structure of B ACE 

The structure of BACE we present here has been solved in the absence of substrate or 
10 inhibitor. This is the first time that such a structure has been described. The solution of this 
structure has been possible as we have, for the first time, crystallized BACE without 
compoimd in a form suitable for diffracting X-rays, and hence allowed the determination of 
the apo structure of BACE. Under our conditions it crystallizes in sp£ice group P6i22 with a 
monomer in the asymmetric imit. This is a novel crystal form of BACE. 

1 5 The protein chain has been traced in the electron density firom residue Phe47p to Alal 57, 
and then firom Alal68 to Asn385. There is no indication as to the position of residues 158 to 
167 in the electron density map. In addition to the protein atoms, the model contains 3 
iodine atoms and 285 water molecules in its present state of refinement. 

The majority of the residues in this form of BACE are well defined, the exceptions being 
20 some exposed residues. Parts of the protein surface are exposed to solvent, as a consequence 
of the molecular packing within the crystal lattice (Figure 1). Residues 255-259, 271-277 
and 310 to 3 17 are exposed and have high B-factors relative to the body of the protein. In 
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addition, residues 304 to 309 pack against an exposed loop and are poorly ordered with high 
b-factors. There are three disulphide bonds in BACE, two of these are well defined in the 
electron density, the third, between Cys269 and Cys319 has high temperature factors. This 
is probably a consequence of its proximity to exposed parts of the protein. 

5 BACE as it has been solved in this form, is a compact globular protein, which is formed by 
two domains; domain 1 being comprised of residues 47p-146 and domain 2 of residues 
(146-3 85)(numbering from Hong et al, 2000). The active site lies between these two 
domains, and contains the two conserved aspartic acid residues, Asp32 and Asp228, which 
define the active sites of aspartic proteinases, hi our structure, a single water molecule is 

10 coordinated between these two residues. 

The overall fold of the protein is similar to that of IFKN (Hong et al, 2000), with a few 
minor, but potentially significant changes. Residues 158-166 are ordered in the structure of 
BACE in the presence of OM99-2 (m the P2i form), and consist of a loop plus a short helix. 
In the P6i22 unliganded form, these residues cannot be seen, and are assumed to be mobile. 
This may be a consequence of the crystal packing arrangement in this form. Residues 69-75 
have a different arrangement in the crystal form described here, to their arrangement in the 
crystal structure of the OM99-2 complex. The residues are displaced upward relative to the 
active site in the structure without OM99-2. The two molecules can be superposed over all 
residues using the program MAPS (MAPS-Multiple Alignment of Proteins Structures 
Version 0.2, Sep-7-1999, Guoguang, Lund University, Sweden and Lu, G. An Approach for 
Multiple Alignment of Protein Structures (1998, in manuscript) to give an r.m.s.d. of 0.74 
A. This results in close alignment of the N-terminal residue prior to residue 69 and 
subsequent to 75, In contrast the CA atoms of residue 71 are displaced by 3.3 A, those of 
residue 72 by 4.3 A, and those of residue 73 by 6,0 A. (Figure 2) The reason for this 
difference is postulated to be the interaction of OM99-2 backbone residues with the protein 
residues, in an arrangement analogous to a beta sheet This interaction pulls the loop down 
over the substrate in the active site, and locks it in position. In the absence of substrate, or 
peptidic inhibitor, the loop moves back up again. 

In addition to these local changes in structure, on binding of inhibitor, there appears to be a 
30 slight shift in the domain positions relative to each other, resulting in an average difference 
in position in the C-terminal domain C A atoms of about 2.0 A, when the molecules are 
superposed using the N-terminal CA atoms. 



20 
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The symmetry of the P6i22 crystal system has resulted in a packing arrangement which 
brings part of a symmetry related molecule very close to the active site entrance of B ACE. 
Gln73 from a synmietry related molecule lies very close to the entrance to the active site of 
BAGE in this crystal form, and overlaps with the position occupied by P4 Glu in OM99-2. 
5 However, this does not interfere wdth the usefulness of this crystal system to soak in 

inhibitors, as we have shown that these crystals can be used to soak BACE inhibitors into 
the active site. 
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Equivalents 

The foregoing description details presently preferred embodiments of the present invention 
which are therefore to be considered in all respects as illustrative and not restrictive. Those 
10 skilled in the art will recognize, or be able to ascertain, using no more than routine 
experimentation, many equivalents, modifications and variations to the specific 
embodiments of the invention described specifically herein. Such equivalents, modifications 
and variations are intended to be (or are) encompassed in the scope of the foUowing 
paragraphs: 



1. A mutant BACE protein, which protein lacks one or more proteolytic cleavage sites 
recognized by clostripain (or another protease which recognizes the same cleavage 
site as clostripain). 

2. The protein of paragraph 1 wherein BACE residues R56 and/or R57 (based on 
numbering of SwissProt P56817) are mutated or deleted. 

3 . The protein of paragraph 2 wherein R56 or R57 are mutated by the substitution of 
arginine for lysine. 

4. The protein of paragraph 2 wherein R56 and R57 are mutated by the substitution of 
arginine for lysine. 

5. The protein of any one of the preceding paragraphs which comprises BACE residues 
56 to 396 (based on numbering of SwissProt P56817). 
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6. A mutant BACE protein (for example, a mutant BACE protein as defined in any one 
of the preceding paragraphs) which is truncated at the N-terminal up to and including 
R42, R45, G55, R56 or R57. 

7. The protein of any one of paragraphs 1 to 6 truncated at the C-terminal such that at 
least residues 454 etseq, are absent 

8. The protein of paragraph 7 truncated at the C-terminal such that at least residues 447 
et seq, are absent. 

9. The protein of any one of the preceding paragraphs wherein the asparagine residues 
at positions 153, 172, 223 and 354 are mutated to glutamine residues. 

10. The protein of any one of the preceding paragraphs which is xm- or deglycolsylated. 

11. A mutant BACE protein selected from: (a) SEQ ID 6; (b) SEQ ID 8; (c) SEQ ID 10; 
(d) SEQ ID 12; (e) SEQ ID 14; (f) SEQ ID 16; (g) SEQ ID 18; (h) SEQ ID 19; (i) 
SEQ ID 20; (j) SEQ ID 21. 

12. Nucleic acid encoding the protein of any one of the preceding paragraphs. 

13. A vector comprising the nucleic acid of pars^aph 12. 

14. A host cell comprising the vector of paragraph 13. 

15. A process for producing the protein of any one of paragraphs 1 to 1 1 comprising the 
steps of: (a) culturing the host cell of paragraph 14 xmder conditions suitable for 
expression of the protein; and optionally (b) isolating the expressed recombinant 
BACE protein. 

16. A process for producing refolded recombinant BACE comprising the steps of: (a) 
solubilising the recombinant BACE; (b) diluting the solubilised BACE into an 
aqueous buffer containing sulfobetaine (for example at a concentration of 10 to 50 
mM); and (c) maintaining the diluted solution at low temperature (for example, 3 to 
6'C) and at high pH (e.g. 9 to 10.5) for at least 2 weeks. 



17. 



The process of paragraph 16 wherein the recombinant BACE is produced according 
to the process of paragraph 15. 
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18. Refolded recombinant BACE produced by, or obtainable by, the process of 
paragraph 16 or paragraph 17. 

19. A process for producing a crystal of BACE comprising the step of refolding 
recombinant BACE protein according to the process of paragraph 16 or paragraph 
17. 

A process for producing a crystal of BACE comprising the step of growing the 
crystal by vapour diffusion using a reservoir buffer that contains 18-26 % PEG 5000 
MME (for example, 20-24 % PEG 5000 MME, e.g. 20-22.5 % PEG 5000 MME), 
180-220 mM (e.g. 200 mM) ammonium iodide and 180-22- mM (e.g. 200 mM) tri- 
sodium citrate (pH 6.4-6.6). 

The process of paragraph 20 wherein the BACE is recombinant and the process 
further comprises the preliminary step of refolding the recombinant BACE according 
to the process of paragraph 16 or paragraph 17. 

22. The process of any one of paragraphs 18 to 20 further comprising the step of 
activating the BACE by clostripain digestion. 

23. The process of paragraph 21 wherein the BACE is as defined in any one of 
paragraphs 1 to 10. 

24. A crystal of BACE produced by, or obtainable by, the process of any one of 
paragraphs 18 to 22. 

25. A crystal of BACE having a hexagonal space group P6i22. 

26. The crystal of paragraph 25 having unit cell dimensions of a=b=103.2 A, c=169. 1 A, 
a=P=60**, 7=120°, and a unit cell variability of 5% in all dimensions. 

27. The crystal of paragraph 25 or paragraph 26 which contains one copy of BACE in 
the asymmetric unit. 

28. A crystal of BACE (e.g. a crystal according to any one of paragraphs 24 to 27) 
having a resolution better than 3 A. 

29. The crystal of paragraph 28 having a resolution better than 2.5 A. 



20. 



21. 
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30. The crystal of paragraph 29 havmg a resolution better than 1 .8 A. 

31. A crystal of BACE (e.g. a crystal according to any one of paragraphs 24 to 30) 
comprising a structure defined by all or a portion of the co-ordinates of Table 1 , 

32. The crystal of paragraph 3 1 comprising a structure defined by a portion of the 
coordinates of Table 1 which coordinates relate to: (a) the BACE catalytic domain; 
and/or (b) a BACE active site; and/or (c) a BACE binding cavity; and/or (d) selected 
amino acid residues of a BACE binding cavity located in a protein firamework which 
holds the selected amino acids in a relative sfpatial configuration which corresponds 
to the spatial configuration of those amino acids in Table 1 ; and/or (d) a BACE 
binding site. 

33. The crystal of paragraph 32 wherein the portion of the coordinates of Table 1 
comprise (or consist essentially of) those relating to residues SER71, GLY72, 
LEU91, ASP93, GLY95, SER96, VAL130, PR0131, TYR132, THR133, GLN134, 
ILE171, ILE179, ILE187, ALA188, ARG189, PRO190, TRP258, TYR259, ASP284, 
LYS285, ASP289, GLY291, THR292, THR293, ASN294, ARG296 and ARG368 
(based on the numbering of SwissProt P56817). 

34. The crystal of paragraph 33 wherein the portion of the coordinates of Table 1 
comprise (or consist essentially of) those relating to residues LYS70, SER71, 
GLY72, GLN73, GLY74, TYR75, LEU91, VAL92, ASP93, THR94, GLY95, 
SER96, SER97, ASN98, TYR129, VAL130, PR0131, TYR132, THR133, GLN134, 
GLY135, LYS136, TRP137, LYS168, PHE169, PHE170, ILE171, ASN172, 
SER174, TRP176, GLY178, ILE179, LEU180, GLY181, ALA183, TYR184, 
ALA185, GLU186, ILE187, ALA188, ARG189, PRO190, ASP191, ASP192, 
ARG256, TRP258, TYR259, TYR283, ASP284, LYS285, SER286, ILE287, 
VAL288, ASP289, SER290, GLY291, THR292, THR293, ASN294, LEU295, 
ARG296, GLY325, GLU326, ARG368, VAL370, LYS382, PHE383, ALA384, 
ILE385, SER386, GLN387, SER388, SER389, THR390, GLY391, THR392, 
VAL393, GLY395, ALA396 and ILE447 (based on the numbering of SwissProt 
P56817). 

35. The crystal of any one of paragraphs 24 to 34 which is capable of being soaked with 
compound(s) to form co-complex structures. 
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36. The crystal of any one of paragraphs 24 to 35 which is soaked with one or more 
compound(s) to form co-complex structures. 

37. The crystal of any one of paragraphs 24 to 36 wherein the BACE is co-crystallized 
with one or more compound(s) to form co-ciystallized structures. 

38. The crystal of any one of paragraphs 24 to 35 which is an apo crystal. 

39. The crystal of any one of paragraphs 24 to 38 wherein the BACE is a wild-type 
BACE. 

40. The crystal of paragraph 39 wherein the BACE is a human BACE. 

41 . The crystal of paragraph 40 wherein the BACE is a homologue of a human BACE. 

42. The crystal of paragraph 41 wherein the homologue is an orthologue or a paralogue 
of a human BACE. 

43. The crystal of any one of paragraphs 24 to 38 wherein the BACE is a mutant and/or 
recombinant BACE. 

44. The crystal of paragraph 43 wherein the BACE: (a) lacks the BACE transmembrane 
and/or cytoplasmic domain(s); and/or (b) lacks one or more glycolsylation sites; 
and/or (c) comprises one or more peptide tags (for example a his tag); and/or (d) 
lacks one or more protease cleavage site(s); and/or (e) is truncated at the N-terminus; 
and/or (f) is truncated at the C-termmns; and/or (f) lacks the BACE pro-sequence. 

45. The crystal of paragraph 44 wherein the BACE mutant lacks one or more clostripain 
cleavage sites. 

46. The crystal of paragraph 45 wherein BACE residues R56 and/or R57 (based on 
numbering of SwissProt P56817) are mutated or deleted. 

47. The crystal of paragraph 46 wherein R56 or R57 are mutated by the substitution of 
arginine for lysine. 



48. 



The crystal of paragraph 46 wherein R56 and R57 are mutated by the substitution of 
arginine for lysine. 
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49. The crystal of any one of paragraphs 43 to 48 wherein the BACE mutant is truncated 
at the N-terminal up to and including R42. 

50. The crystal of any one of paragraphs 43 to 49 wherein the BACE mutant is truncated 
at the C-terminal such that at least residues 396 et seg. are absent. 

5 1 . The crystal of paragraph 50 wherein the BACE mutant is truncated at the C-terminal 
such that at least residues 387 et seq, are absent. 

52. The crystal of any one of paragraphs 43 to 51 wherein the asparagine residues at 
positions 153, 172, 223 and 354 of the BACE mutant are mutated to glutamine 
residues. 

53. The crystal of any one of paragraphs 24 to 52 wherein the BACE is un- or 
deglycolsylated. 

54. The crystal of paragraph 43 wherein the BACE mutant is selected from: (a) SEQ ID 
19; (b) SEQ ID 20; (c) SEQ ID 21, 

55. The process of any one of paragraphs 19 to 23 wherein the process produces a crystal 
of BACE as defined in any one of paragraphs 24 to 54. 

56. A three-dimensional representation of BACE or of a portion of BACE, wiiich 
representation comprises all or a portion of the coordinates of Table 1. 

57. The three-dimensional representation of paragraph 56 which is a model constructed 
fcom all or a portion of the coordinates of Table 1. 

58. The model of paragraph 57 wherein the portion of BACE is a BACE binding cavity 
and the portion of the coordinates of Table 1 comprise those of atoms defining a 
binding site within the binding cavity (for example, wherein the coordinates are as 
defined in paragraph 33 or paragraph 34). 

59. A three-dimensional representation of a compound which fits the model of paragraph 
57 or paragraph 58. 

60. The three-dimensional representation of paragraph 59 which is a model of the 
compound. 
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61 . The model of paragraph 60 wherein the compound is a pharmacophore. 

62. The model of any one of paragraphs 57, 58, 60 or 61 which is: (a) a wire-frame 
model; (b) a chicken-wire model; (c) a ball-and-stick model; (d) a space-filling 
model; (e) a stick-model; (f) a ribbon model; (g) a snake model; (h) an arrow and 
cylinder model; (i) an electron density map; (j) a molecular surface model. 

63. The model of any one of paragraphs 57, 58, 60, 61 or 62 which is in physical form. 

64. The model of any one of paragraphs 57, 58, 60, 61 or 62 which is in electronic form. 

65. The model of paragraph 64 which comprises a graphical image display on a 
computer screen. 

66. A computer-based method for the analysis of the interaction of a molecular structure 
with a BACE structure of the invention, which comprises: (a) providing a BACE 
model as defined in paragraph 57, 58 or 62 to 65; (b) providing a molecular stracture 
to be fitted to said BACE model; and (c) fitting the molecular structure to the BACE 
model to produce a compoxmd model as defined in paragraph 60, 61 or 62 to 65. 

67. A computer-based method for the ansilysis of the interaction of a molecular structure 
with a BACE structure of the invention, which comprises: (a) providing the structure 
of a BACE as defined by the coordinates of Table 1; (b) providing a molecular 
structure to be fitted to said BACE structure; and (c) fitting the molecular structure to 
the BACE structure of Table 1. 

68. A computer-based method for the analysis of molecular structures which comprises: 
(a) providing the coordinates of at least two atoms of a BACE structure as defined in 
Table 1 ("selected coordinates"); (b) providing the structure of a molecular structure 
to be fitted to the selected coordinates; and (c) fitting the structure to the selected 
coordinates of the BACE structure. 

69. The method of paragraph 68 wherein the selected coordinates represent a binding 
pocket. 

70. The method of paragraph 68 or paragraph 69 wherein the selected coordinates are of 
at least 5, 10, 50 or 100 atoms. 
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7 1 . The method of paragraph 69 or paragraph 70 wherein the selected coordinates are as 
defined in paragraph 33 or paragraph 34. 

72. A computer-based method of rational drug design comprising the method of any one 
of paragraphs 66 to 71. 

73. A computer-based method of rational drug design comprising comprising: (a) 
providing the coordinates of at least two atoms of a BACE structure as defined in 
Table 1 ("selected coordinates"); (b) providing the structures of a plurality of 
molecular fi:^gments; (c) fitting the structure of each of the molecular firagments to 
the selected coordinates; and (d) assembling the molecular firagments into a single 
molecule to form a candidate modulator molecule. 

74. A method for identifying a candidate modulator (e.g, candidate inhibitor) of BACE 
comprising the steps of: (a) emplo)dng a three-dimensional structure of BACE, at 
least one sub-domain thereof, or a plurality of atoms thereof, to characterise at least 
one BACE binding cavity, the three-dimensional structure being defined by atomic 
coordinate data according to Table 1; and (b) identifying the candidate modidator by 
designing or selecting a compound for interaction with the binding cavity. 

75. The method of paragraph 74 wherein the three-dimensional structure of BACE is a 
model as defined in paragraph 57 or paragraph 58. 

76. A method for identifying an agent compound (e.g. an inhibitor) which modulates 
BACE activity, comprising the steps of: (a) employing three-dimensional atomic 
coordinate data according to Table 1 to characterise at least one (e.g. a plurality of) 
BACE binding site(s); (b) providing the structure of a candidate agent compoimd; (c) 
fitting the candidate agent compoimd to the binding sites; and (d) selecting the 
candidate agent compound. 

77. The method of paragraph 76 wherein in step (a) the three-dimensional atomic 
coordinate data are employed to create a model as defined in paragraph 57, 58 or 62 
to 65. 



78. 



The method of any one of paragraphs 73 to 77further comprising the step of: (a) 
obtaining or synthesising the candidate agent or modvilator; and (b) contacting the 
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candidate modulator with BACE to determine the ability of the candidate modulator 
to interact with BACE. 

79. A method of assessing the ability of a candidate modulator to interact with BACE 
which comprises the steps of: (a) obtaining or synthesising said candidate modulator; 
(b) forming a crystallized complex of BACE and said candidate modulator; and (c) 
analysing said complex by X-ray crystallography or NMR spectroscopy to determine 
the ability of said candidate modulator to interact with BACE. 

80. A method for determining the structure of a compoimd bound to BACE, said method 
comprising: (a) mixing BACE with the compound to form a BACE-compound 
complex; (b) crystallizing the BACE-compound complex; and (c) determining the 
structure of said BACE-compound(s) complex by reference to the data of Table 1. 

81. A method for determining the structure of a compound bound to BACE, said method 
comprising: (a) providing a crystal of BACE; (b) soaking the crystal with one or 
more compound(s) to form a complex; and (c) determining the structure of the 
complex by employing the data of Table 1. 

82. A method of determining the three dimensional structure of a BACE homologue or 
analogue of unknown structure, the method comprising the steps of: (a) aligning a 
representation of an amino acid sequence of the BACE homologue or analogue with 
the amino acid sequence of the BACE of Table 1 to match homologous regions of 
the amino acid sequences; (b) modelling the structure of the matched homologous 
regions of said target BACE of unknown stmcture on the corresponding regions of 
the BACE structure as defined by Table 1; and (c) detennining a conformation for 
the BACE homologue or analogue which substantially preserves the stmcture of said 
matched homologous regions. 

83. The method of paragraph 82 wherein steps (a) and/or (b) and/or (c) are performed by 
computer modelling. 

84. A method of providing data for generating structures and/or performing mtional drug 
design for BACE, BACE homologues or analogues, complexes of BACE with a 
potential modulator, or complexes of BACE homologues or analogues with potential 
modulators, the method comprising: (i) establishing communication with a remote 
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device containing computer-readable data comprising at least one of: (a) atomic 
coordinate data according to Table 1, said data defining the three-dimensional 
structure of BACE, at least one sub-domain of the three-dimensional structure of 
BACE, or the coordinates of a plurality of atoms of B ACE; (b) structure factor data 
for BACE, said structure factor data being derivable from the atomic coordinate data 
of Table 1 ; (c) atomic coordinate data of a target BACE homologue or analogue 
generated by homology modelling of the target based on the data of Table 1; (d) 
atomic coordinate data of a protein generated by interpreting X-ray crystallographic 
data or NMR data by reference to the data of Table 1 ; and (e) structure factor data 
derivable from the atomic coordinate data of (c) or (d); and (ii) receiving said 
computer-readable data firom said remote device. 

85. A computer system containing one or more of: (a) atomic coordinate data according 
to Table 1, said data defining the three-dimensional structure of BACE or at least 
selected coordinates thereof; (b) structure factor data (where a structure factor 
comprises the amplitude and phase of the diffracted wave) for BACE, said stmcture 
factor data being derivable from the atomic coordinate data of Table 1; (c) atomic 
coordinate data of a target BACE protein generated by homology modelling of the 
target based on the data of Table 1; (d) atomic coordinate data of a target BACE 
protein generated by interpreting X-ray crystallographic data or NMR data by 
reference to the data of Table 1 ; or (e) structure factor data derivable from the atomic 
coordinate data of (c) or (d). 

86. The computer system of paragraph 85 comprising: (i) a computer-readable data 
storage medivim comprising data storage material encoded with the computer- 
readable data; (ii) a worldng memory for storing instructions for processing said 
computer-readable data; and (iii) a central-processing unit coupled to said working 
memory and to said computer-readable data storage medium for processing said 
computer-readable data and thereby generating structures and/or performing rational 
drug design. 

87. The computer system of paragraph 86 ftirther comprising a display coupled to said 
central-processing unit for displaying said structures. 
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88. A computer-readable storage medium, comprising a data storage material encoded 
with computer readable data, wherein the data are defined by all or a portion of the 
structure coordinates of B ACE of Table 1, or a homologue of B ACE, wherein said 
homologue comprises backbone atoms that have a root mean square deviation firom 
the backbone atoms (nitrogen-carbona-carbon) of Table 1 of not more than 1.5 A. 

89. A computer-readable data storage medium comprising a data storage material 
encoded vsdth a first set of computer-readable data comprising a Fourier transform of 
at least a portion (e.g. selected coordinates as defined herein) of the structural 
coordinates for BACE according to Table 1; which, when combined with a second 
set of machine readable data comprising an X-ray diflBraction pattem of a molecule or 
molecular complex of unknown structure, using a machine programmed v^th the 
instructions for using said first set of data and said second set of data, can determine 
at least a portion of the stmcture coordinates corresponding to the second set of 
machine readable data. 

90. A computer readable medium with at least one of: (a) atomic coordinate data 
according to Table 1 recorded thereon, said data defining the three-dimensional 
structure of BACE, or at least selected coordinates thereof; (b) stmcture factor data 
for BACE recorded thereon, the stmcture factor data being derivable fix)m the atomic 
coordinate data of Table 1 ; (c) atomic coordinate data of a target BACE protein 
generated by homology modelling of the target based on the data of Table 1; (d) 
atomic coordinate data of a BACE-ligand complex or a BACE homologue or 
analogue generated by interpreting X-ray crystallographic data or NMR data by 
reference to the data of Table 1 ; and (e) structure factor data derivable from the 
atomic coordinate data of (c) or (d). 

9L A method for determining the stmcture of a protein, which method comprises; 

providing the co-ordinates of Table 1, and either (a) positioning the co-ordinates in 
the crystal unit cell of said protein so as to provide a stracture for said protein or (b) 
£issigning NMR spectra Peaks of said protein by manipulating the coordinates of 
Table 1. 

92. A process for producing a medicament, pharmaceutical composition or dmg, the 
process comprising: (a) identifying a BACE modulator molecule according to the 
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method as defined in any one of paragraphs 73 to 79; (b) optimising the structure of 
the modulator molecule; and (c) preparing a medicament, pharmaceutical 
composition or drug containing the optimised modulator molecule. 

93. A medicament, pharmaceutical composition or drag produced by, or obtainable by, 
the process of paragraph 92. 

94. A compound identified, produced or obtainable by the process or method of any one 
of paragraphs 73 to 79. 

95. A pharmaceutical composition, medicament, drag or other composition comprising 
the compound of paragraph 94. 

96. The medicament, pharmaceutical composition or drag of paragraph 93, compound of 
paragraph 94 or composition of paragraph 95 for use in medicine, for example for 
use in therapy or prophylaxis. 

97. The medicament, pharmaceutical composition, drag or composition of paragraph 96 
wherein the therapy or prophylaxis comprises inhibiting BACE or the production of 
Ap or firagments thereof or the treatment of Alzheimer's disease. 

98. A method of inhibiting BACE or the production of AP or firagments thereof or 
treating Alzheimer's disease comprising administering the medicament, 
pharmaceutical composition, drag or composition of paragraph 96 to the patient 

99. The method of paragraph 84, wherein the computer readable data is transmitted form 
the remove device, 

100. The method of paragraph 99, wherein the data is transmitted electronically or 
optically. 
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TABLE 1 



ATOM 


1 


N 


PHE A 


47d 


65.730 


61. 598 


-17.857 


1.00 56.68 




N 


ATOM 


2 


CA 


PHS A 


47o 


66.426 


61.383 


-16.552 


1.00 54.16 




c 


ATOM 


3 


c 


PHE A 


47p 


67 .801 


60.738 


-16.734 


1.00 54 30 




c 


ATOM 


4 


0 


PHE A 


47d 


68 .258 


59 . 983 


-15 . 869 


1.00 52 46 




Q 


ATOM 


5 


CB 


PHE A 


47p 


65 ,566 


60 , 500 


-15 . 635 


1.00 54 61 






ATOM 


6 


CG 


PHE A 


47p 


64 .161 


61 . 008 


-15. 429 


1.00 54,65 




Q 


ATOM 


7 


GDI 


PHE A 


47d 


63 . 110 


60 . 508 


-16. 186 


1 00 56 21 




Q 


ATOM 


8 


CD2 


PHE A 


47p 


63 ,887 


61 . 970 


-14 .463 


1.00 55.01 


ft 


Q 


ATOM 


9 


CEl 


PHE A 


47d 


61 .812 


60 . 972 


-15 . 995 


1 . 00 57.39 


ft 


Q 


ATOM 


10 


CB2 


PHE A 


47p 


62 .596 


62 , 435 


-14 ,266 


1.00 56.06 


ft 


Q 


ATOM 


11 


CZ 


PRE A 


47p 


61 .556 


61 . 938 


-15.035 


1.00 56.47 


ft 


Q 


ATOM 


12 


M 


VAIi A 


48p 


68 . 468 


61 .048 


-17.845 


1.00 54.26 


ft 


N 


ATOM 


13 


CA 


VAL A 


48p 


69.737 


60.395 


-18.200 


1.00 54.45 


ft 


Q 


ATOM 


14 


c 


VAL A 


48p 


70.910 


60.742 


-17 .276 


1.00 53.21 


ft 


Q 


ATOM 


15 


0 


VAL A 


48p 


71 .847 


59 , 947 


-17 . 128 


1.00 56.35 


ft 


Q 


ATOM 


16 


CB 


VAL A 


48p 


70.156 


60 . 691 


-19. 662 


1.00 57.43 


ft 




ATOM 


17 


CGI 


VAL A 


48p 


69.222 


59 . 972 


-20. 636 


1.00 58.42 


ft 


Q 


ATOM 


18 


CG2 


VAL A 


48p 


70.204 


62 .208 


-19.944 


1.00 57.43 


ft 


c 


ATOM 


19 


N 


GLU A 


1 


70.860 


61 . 925 


-16.668 


1.00 49.17 


ft 


N 


ATOM 


20 


CA 


GLU A 


1 


71.845 


62 . 329 


-15.674 


1.00 46.84 


ft 


c 


ATOM 


21 


C 


GLU A 


1 


71. 857 


61.373 


-14 .479 


1.00 42.66 


ft 


c 


ATOM 


22 


0 


GLU A 


1 


72.901 


61.125 


-13.891 


1.00 45.10 


ft 


o 


ATOM 


23 


CB 


GLU A 


1 


71.532 


63.740 


-15.171 


1.00 48.32 


A 




ATOM 


24 


CG 


GLU A 


1 


70.180 


64 .053 


-14 .545 


0.00 50.15 


ft 


Q 


ATOM 


25 


CD 


GLU A 


1 


68.942 


64 .394 


-15 .351 


0.00 51.10 


ft 


Q 


ATOM 


26 


OEl 


GLU A 


1 


68 .516 


63 .562 


-16.178 


0.00 51.29 


ft 


o 


ATOM 


27 


0E2 


GLU A 


1 


68.395 


65 .500 


-15 .155 


0.00 51.61 


ft 


o 


ATOM 


28 


N 


MET A 


2 


70.685 


60 .855 


-14 .125 


1.00 37.18 


ft 


N 


ATOM 


29 


CA 


MET A 


2 


70.525 


60 . 001 


-12 .942 


1,00 32.72 


A 


Q 


ATOM 


30 


c 


MET A 


2 


70 .875 


58 . 531 


-13 . 154 


1.00 29.50 


ft 


Q 


ATOM 


31 


0 


MET A 


2 


71 .014 


57 . 787 


-12 , 183 


1.00 29.19 


ft 


o 


ATOM 


32 


CB 


MET A 


2 


69,099 


60. Ill 


-12 .415 


1.00 30.14 


ft 


Q 


ATOM 


33 


CG 


MET A 


2 


68 -733 


61 .514 


-12 .005 


1.00 34.84 


ft 


Q 


ATC»4 


34 


SD 


MET A 


2 


67 .103 


61 .723 


-11 .322 


1.00 36.26 


ft 


3 


ATOM 


35 


CE 


MET A 


2 


66.607 


63 .243 


-12 ,134 


1.00 40.05 


ft 


Q 


ATOM 


36 


N 


VAL A 


3 


71.008 


58 .079 


-14 .396 


1.00 28.21 


ft 


N 


ATOM 


37 


CA 


VAL A 


3 


71 .291 


56. 669 


-14 . 611 


1.00 29 18 


ft 


Q 


ATOM 


38 


C 


VAL A 


3 


72.690 


56.364 


-14 .085 


1.00 27.28 


ft 




ATOM 


39 


0 


VAL A 


3 


73 . 622 


57 . 149 


-14 .298 


1.00 28.33 


ft 


o 


ATOM 


40 


CB 


VAL A 


3 


71.137 


56,248 


-16 .094 


1.00 32.19 


ft 


c 


ATOM 


41 


CGI 


VAL A 


3 


71, 649 


54 . 826 


-16.299 


1,00 30.92 


ft 


Q 


ATOM 


42 


CG2 


VAL A 


3 


69.667 


56.353 


-16.525 


1.00 32.71 


ft 


c 


ATOM 


43 


N 


ASP A 


4 


72.803 


55.254 


-13 .359 


1.00 28.19 


ft 


N 


ATOM 


44 


CA 


ASP A 


4 


74.066 


54 .739 


-12.825 


1.00 29.50 


ft 


c 


ATOM 


45 


C 


ASP A 


4 


74.600 


55.632 


-11.703 


1.00 27.86 


ft 


c 


ATOM 


46 


0 


ASP A 


4 


75.797 


55.682 


-11.454 


1.00 28.77 


ft 


o 


ATOM 


47 


CB 


ASP A 


4 


75.107 


54.575 


-13, 940 


1.00 32.06 


ft 




ATOM 


48 


CG 


ASP A 


4 


76.254 


53. 655 


-13 . 553 


1.00 37.52 


ft 




ATOM 


49 


ODl 


ASP A 


4 


76.029 


52.572 


-12 . 945 


1 00 38 24 


ft 


Q 


ATOM 


50 


0D2 


ASP A 


4 


77.438 


53 . 952 


-13 . 829 


1.00 45.15 


ft 


Q 


ATOM 


51 


N 


ASN A 


5 


73 . 694 


56.308 


-11 . 015 


1 00 24 98 


ft 


N 


ATOM 


52 


CA 


ASN A 


5 


74 . 062 


57 . 172 


-9.87 6 


1 00 Ifl <3=i 


A 


Q 


ATOM 


53 


C 


ASN A 


5 


74 .270 


56.415 


-8 . 544 


1 00 22 40 


ft 




ATOM 


54 


0 


ASN A 


5 


74 . 564 


57 .045 


-7 . 515 


1.00 21.31 


ft 


o 


ATOM 


55 


CB 


ASN A 


5 


73.064 


58.329 


-9 .718 


1.00 21.03 


ft 


c 


ATOM 


56 


CG 


ASN A 


5 


71.677 


57.870 


-9.366 


1.00 16.73 


ft 


c 


ATOM 


57 


ODl 


ASN A 


5 


71.424 


56.673 


-9 .325 


1.00 19.74 


ft 


o 


ATOM 


58 


ND2 


ASN A 


5 


70.801 


58.808 


-9.035 


1.00 21.06 


ft 


N 


ATOM 


59 


N 


LEU A 


6 


74 .099 


55.098 


-8 .562 


1.00 15.94 


ft 


N 


ATOM 


60 


CA 


LEU A 


6 


74 .323 


54.236 


-7 .397 


1.00 16.57 


ft 


c 


ATOM 


61 


C 


LEU A 


6 


75. 531 


53.321 


-7 .510 


1.00 21.72 


ft 


c 


ATOM 


62 


O 


LEU A 


6 


75.855 


52.780 


-8.581 


1,00 21.55 


A 


o 


ATOM 


63 


CB 


LEU A 


6 


73.109 


53.352 


-7.078 


1.00 18.17 


A 


C 


ATOM 


64 


CG 


LEU A 


6 


71.707 


53.957 


-6.866 


1.00 19.32 


A 


C 


ATOM 


65 


CDl 


LEU A 


6 


70.695 


52.916 


-6.521 


1.00 17.46 


A 


C 


ATOM 


66 


CD2 


LEU A 


6 


71.748 


54.997 


-5.797 


1.00 21.42 


A 


C 


ATOM 


67 


N 


ARG A 


7 


76.173 


53.126 


-6.364 


1.00 21.10 


A 


N 


ATOM 


68 


CA 


ARG A 


7 


77.333 


52.266 


-6.230 


1.00 23.84 


A 


c 


ATOM 


69 


C 


ARG A 


7 


77.237 


51.485 


-4.939 


1.00 25.78 


A 


c 


ATOM 


70 


0 


ARG A 


7 


76.424 


51.808 


-4.059 


1.00 21.54 


A 


o 


ATOM 


71 


CB 


ARG A 


7 


78.610 


53.103 


-6.226 


1.00 26.25 


A 


c 


ATOM 


72 


CG 


ARG A 


7 


78.992 


53.658 


-7.583 


1.00 30.55 


A 


c 


ATOM 


73 


CD 


ARG A 


7 


80.135 


54.652 


-7.549 


1.00 37.65 


A 


c 


ATOM 


74 


NE 


ARG A 


7 


80.063 


55,407 


-8.932 


0.00 40.50 


A 


N 
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ATOM 


75 


CZ 


ARG 


A 


7 


80.997 


56.306 


-9.222 


0.00 


41.92 


A 


C 


ATOM 


76 


NHl 


ARG 


A 


7 


80.991 


56.911 


-10.402 


0.00 


42 .93 


A 


N 


ATOM 


77 


NH2 


ARG 


A 


7 


81.937 


56.601 


-8.335 


0.00 


42 .80 


A 


N 


ATOM 


78 


N 


GLY 


A 


8 


78.091 


50.479 


-4.799 


1.00 


26,16 


A 


N 


ATOM 


79 


CA 


GLY 


A 


8 


78.086 


49.663 


-3.598 


1.00 


29.54 


A 


C 


ATOM 


80 


C 


GLY 


A 


8 


79.032 


48.490 


-3.639 


1.00 


31.18 


A 


C 


ATOM 


81 


O 


GLY 


A 


8 


79.790 


48.325 


-4.591 


1.00 


33.68 


A 


o 


ATOM 


82 


N 


LYS 


A 


9 


78.986 


47.685 


-2.587 


1.00 


34.88 


A 


N 


ATOM 


83 


CA 


LYS 


A 


9 


79.643 


46.390 


-2.578 


1.00 


36.27 


A 


C 


ATOM 


84 


C 


LYS 


A 


9 


78.625 


45.337 


-2.169 


1.00 


37.50 


A 


C 


ATOM 


85 


0 


LYS 


A 


9 


77.771 


45.576 


-1.316 


1.00 


32.87 


A 


o 


ATOM 


86 


CB 


LYS 


A 


9 


80.861 


46.396 


-1.649 


1.00 


39.66 


A 


c 


ATOM 


87 


CG 


LYS 


A 


9 


81.975 


47.324 


-2.120 


1.00 


45 .29 


A 


c 


ATOM 


88 


CD 


LYS 


A 


9 


83.346 


4 6.635 


-2.207 


1.00 


50 .21 


A 


c 


ATOM 


89 


CE 


LYS 


A 


9 


84.382 


47.543 


-2.887 


1.00 


52.01 


A 


c 


ATOM 


90 


NZ 


LYS 


A 


9 


85.408 


48.085 


-1.943 


1.00 


53.23 


A 


N 


ATOM 


91 


N 


SER 


A 


10 


78.708 


44,172 


-2.805 


1.00 


38. 65 


A 


N 


ATOM 


92 


CA 


SER 


A 


10 


77.807 


43.063 


-2,525 


1,00 


39.77 


A 


C 


ATOM 


93 


C 


SER 


A 


10 


77 . 658 


42.852 


-1.026 


1.00 


38.92 


A 


C 


ATOM 


94 


0 


SER 


A 


10 


78.658 


42.718 


-0.316 


1.00 


38.89 


A 


o 


ATOM 


95 


CB 


SER 


A 


10 


78.336 


41.776 


-3.172 


1.00 


41.88 


A 


c 


ATOM 


96 


OG 


SER 


A 


10 


77.485 


40.680 


-2.879 


1.00 


44.59 


A 


o 


ATOM 


97 


N 


GLY 


A 


11 


76.410 


42.857 


-0.556 


1.00 


36.41 


A 


N 


ATOM 


98 


CA 


GLY 


A 


11 


76.097 


42.627 


0,843 


1.00 


35,71 


A 


C 


ATOM 


99 


C 


GLY 


A 


11 


76.076 


43.859 


1.738 


1.00 


35.38 


A 


C 


ATOM 


100 


0 


GLY 


A 


11 


75.631 


43.757 


2.886 


1.00 


37.81 


A 


o 


ATOM 


101 


N 


GLN 


A 


12 


76.519 


45.005 


1.213 


1.00 


34.18 


A 


N 


ATOM 


102 


CA 


GLN 


A 


12 


76.732 


46.234 


1.999 


1.00 


35.64 


A 


C 


ATOM 


103 


C 


GLN 


A 


12 


75, 861 


47.409 


1.536 


1.00 


35.07 


A 


c 


ATOM 


104 


O 


GLN 


A 


12 


76, 148 


48.558 


1.881 


1.00 


36.40 


A 


o 


ATOM 


105 


CB 


GLN 


A 


12 


78.196 


46.693 


1.913 


1.00 


37.52 


A 


c 


ATC^ 


106 


CG 


GLN 


A 


12 


79.230 


45.703 


2.437 


1.00 


42.55 


A 


c 


ATC^d 


107 


CD 


GLN 


A 


12 


80. 653 


46.267 


2.465 


1.00 


40.98 


A 


c 


ATOM 


108 


OEl 


GLN 


A 


12 


81.562 


45. 623 


2.984 


1.00 


49.77 


A 


0 


ATOM 


109 


NE2 


GLN 


A 


12 


80.846 


47.450 


1.904 


1.00 


50.11 


A 


N 


ATOM 


110 


N 


GLY 


A 


13 


74.824 


47.132 


0.749 


1.00 


30.97 


A 


N 


ATOM 


111 


CA 


GLY 


A 


13 


73.887 


48 .163 


0.331 


1.00 


27.55 


A 


C 


ATOM 


112 


C 


GLY 


A 


13 


74.366 


49.021 


-0.820 


1.00 


25.65 


A 


c 


ATOM 


113 


0 


GLY 


A 


13 


75.491 


48 .904 


-1.289 


1.00 


26.10 


A 


o 


ATOM 


114 


N 


TYR 


A 


14 


73.477 


49.892 


-1.275 


1.00 


17.01 


A 


N 


ATOM 


115 


CA 


TYR 


A 


14 


73.738 


50.794 


-2.395 


1.00 


17.38 


A 


C 


ATOM 


116 


C 


TYR 


A 


14 


73.722 


52.218 


-1.880 


1.00 


16.80 


A 


c 


ATOM 


117 


o 


TYR 


A 


14 


72.851 


52.561 


-1.072 


1.00 


17.47 


A 


o 


ATOM 


118 


CB 


TYR 


A 


14 


72.635 


50.663 


-3.446 


1.00 


18.29 


A 


c 


ATOM 


119 


CG 


TYR 


A 


14 


72.651 


49.339 


-4.162 


1.00 


21.45 


A 


c 


ATOM 


120 


CDl 


TYR 


A 


14 


72.134 


48.194 


-3.574 


1.00 


20.72 


A 


c 


ATOM 


121 


CD2 


TYR 


A 


14 


73.201 


49.239 


-5.434 


1.00 


21.04 


A 


c 


ATOM 


122 


CEl 


TYR 


A 


14 


72.164 


46.981 


-4.246 


1.00 


20.87 


A 


c 


ATOM 


123 


CE2 


TYR 


A 


14 


73.233 


48.043 


-6.101 


1.00 


23.36 


A 


c 


ATOM 


124 


CZ 


TYR 


A 


14 


72.723 


46.935 


-5.522 


1.00 


24.50 


A 


c 


ATOM 


125 


OH 


TYR 


A 


14 


72.758 


45.757 


-6.229 


1.00 


27.32 


A 


Q 


ATOM 


126 


N 


TYR 


A 


15 


74.636 


53.044 


-2.387 


1.00 


18.15 


A 


N 


ATOM 


127 


CA 


TYR 


A 


15 


74.727 


54.431 


-1.976 


1.00 


15.54 


A 


c 


ATOM 


128 


C 


TYR 


A 


15 


74 .734 


55.415 


-3.133 


1.00 


16.89 


A 


c 


ATOM 


129 


o 


TYR 


A 


15 


75 .171 


55.108 


-4.243 


1.00 


17.87 


A 


o 


ATOM 


130 


CB 


TYR 


A 


15 


75.951 


54.666 


-1.064 


1.00 


16.46 


A 


c 


ATOM 


131 


CG 


TYR 


A 


15 


77 .308 


54 ,342 


-1.685 


1.00 


15.58 


A 


c 


ATOM 


132 


CDl 


TYR 


A 


15 


77 . 966 


55.246 


-2 .501 


1.00 


19. 48 


A 


c 


ATOM 


133 


CD2 


TYR 


A 


15 


77 . 919 


53.139 


-1.411 


1.00 


19. 60 


A 


c 


ATOM 


134 


CEl 


TYR 


A 


15 


79.201 


54 .956 


-3.034 


1.00 


21.95 


A 


c 


ATOM 


135 


CE2 


TYR 


A 


15 


79.165 


52.838 


-1.926 


1.00 


23.26 


A 


c 


ATOM 


136 


CZ 


TYR 


A 


15 


79.787 


53.734 


-2.739 


1.00 


21.80 


A 


c 


ATOM 


137 


OH 


TYR 


A 


15 


81.006 


53,396 


-3.255 


1.00 


26.17 


A 


0 


ATOM 


138 


N 


VAL 


A 


16 


74.279 


56. 620 


-2.823 


1.00 


17.50 


A 


N 


ATCM 


139 


CA 


VAL 


A 


16 


74.197 


57.728 


-3.760 


1.00 


19.34 


A 


c 


ATOM 


140 


C 


VAL 


A 


16 


75,077 


58.862 


-3.212 


1.00 


20.35 


A 


c 


ATOM 


141 


o 


VAL 


A 


16 


75.165 


59.056 


-1,995 


1.00 


20.27 


A 


o 


ATOM 


142 


CB 


VAL 


A 


16 


72 . 715 


58 . 201 


-3 . 936 


1.00 


18.58 


A 


c 


ATOM 


143 


CGI 


VAL 


A 


16 


72.177 


58.911 


-2.680 


1.00 


18.67 


A 


c 


ATOM 


144 


CG2 


VAL 


A 


16 


72.554 


59.101 


-5.172 


1.00 


21.03 


A 


c 


ATOM 


145 


N 


GLU 


A 


17 


75.715 


59.608 


-4.101 


1.00 


20.07 


A 


N 


ATOM 


146 


CA 


GLU 


A 


17 


76.401 


60.838 


-3.706 


1.00 


22.44 


A 


C 


ATOM 


147 


C 


GLU 


A 


17 


75.398 


61.943 


-3.372 


1.00 


22.83 


A 


C 


ATOM 


148 


0 


GLO 


A 


17 


74.419 


62.145 


-4,091 


1.00 


20.94 


A 


o 


ATOM 


149 


CB 


GLU 


A 


17 


77.360 


61.298 


-4.810 


1.00 


23.72 


A 


C 


ATOM 


150 


CG 


GLU 


A 


17 


78.246 


62.482 


-4.416 


1.00 


28.53 


A 


C 


ATOM 


151 


CD 


GLU 


A 


17 


79.065 


63.024 


-5.580 


1.00 


36.53 


A 


C 
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ATOM 


152 


OEl 


GLU A 


17 


78.956 


64.228 


-5.878 


1.00 


39.02 


A 


O 


ATOM 


153 


0E2 


GLO A 


17 


79.820 


62.249 


-6.201 


1.00 


41.99 


A 


O 


ATOM 


154 


N 


MET A 


18 


75.616 


62.632 


-2.249 


1.00 


18.64 


A 


N 


ATOM 


155 


CA 


MET A 


18 


74 .824 


63.788 


-1.849 


1.00 


18.78 


A 


C 


ATOM 


156 


C 


MET A 


18 


75.744 


64.904 


-1.365 


1.00 


24.24 


A 


C 


ATOM 


157 


O 


MET A 


18 


76.919 


64.671 


-1.079 


1.00 


23.12 


A 


O 


ATOM 


158 


CB 


MET A 


18 


73.866 


63.427 


-0.717 


1.00 


20.09 


A 


C 


ATOM 


159 


CG 


MET A 


18 


72.884 


62.284 


-1.064 


1.00 


17.91 


A 


C 


ATOM 


160 


SD 


MET A 


18 


71.685 


61.911 


0.240 


1.00 


20.92 


A 


s 


ATOM 


161 


CE 


MET A 


18 


70.491 


63.197 


-0.005 


1.00 


21.35 


A 


C 


ATOM 


162 


N 


THR A 


19 


75,229 


66.121 


-1.313 


1.00 


24.86 


A 


N 


ATOM 


163 


CA 


THR A 


19 


75.966 


67.206 


-0.661 


1.00 


26.57 


A 


C 


ATOM 


164 


C 


THR A 


19 


75.122 


67.794 


0.443 


1.00 


24.45 


A 


C 


ATOM 


165 


0 


THR A 


19 


73.904 


67.861 


0.341 


1.00 


23.60 


A 


O 


ATOM 


166 


CB 


THR A 


19 


76.392 


68.292 


-1.665 


1.00 


28.59 


A 


C 


ATOM 


167 


OGl 


THR A 


19 


75.236 


68.833 


-2.311 


1.00 


32.78 


A 


O 


ATOM 


168 


CG2 


THR A 


19 


77.235 


67.712 


-2.775 


1.00 


28.11 


A 


C 


ATOM 


169 


N 


VAL A 


20 


75.775 


68.213 


1.531 


1.00 


25.61 


A 


N 


ATOM 


170 


CA 


VAIi A 


20 


75,078 


68.836 


2.643 


1.00 


22.00 


A 


C 


ATOM 


171 


C 


VAIi A 


20 


75.826 


70.130 


2.995 


1.00 


21.90 


A 


C 


ATOM 


172 


O 


VAL A 


20 


77.040 


70.183 


2.841 


1.00 


23.44 


A 


O 


ATOM 


173 


CB 


VAL A 


20 


75.011 


67.902 


3.848 


1.00 


25.28 


A 


c 


ATOM 


174 


CGI 


VAL A 


20 


74.361 


68.579 


5.033 


1.00 


30.83 


A 


c 


ATOM 


175 


CG2 


VAL A 


20 


74.245 


66.611 


3.495 


1.00 


25.14 


A 


C 


ATOM 


176 


N 


GLY A 


21 


75.077 


71.146 


3.422 


1.00 


25.14 


A 


N 


ATOM 


177 


CA 


GLY A 


21 


75.623 


72.434 


3.837 


1.00 


27.79 


A 


C 


ATOM 


178 


C 


GLY A 


21 


76.015 


73.417 


2.752 


1.00 


26.88 


A 


C 


ATOM 


179 


O 


GLY A 


21 


75.906 


73.137 


1.551 


1.00 


27.40 


A 


0 


ATOM 


180 


N 


SER A 


22 


76.466 


74.594 


3.202 


1.00 


28.28 


A 


N 


ATOM 


181 


CA 


SER A 


22 


76.976 


75.657 


2.330 


1.00 


29.16 


A 


C 


ATOM 


182 


C 


SER A 


22 


78.298 


76.173 


2.919 


1.00 


28.62 


A 


C 


ATOM 


183 


0 


SER A 


22 


78.308 


76.639 


4.049 


1.00 


29.95 


A 


O 


ATOM 


184 


CB 


SER A 


22 


75.983 


76.815 


2.238 


1.00 


29.69 


A 


C 


ATOM 


185 


OG 


SER A 


22 


74.675 


76.366 


1.925 


1.00 


29.77 


A 


O 


ATOM 


186 


N 


PRO A 


23 


79.407 


76.052 


2.198 


1.00 


28.22 


A 


N 


ATOM 


187 


CA 


PRO A 


23 


79.461 


75.401 


0.884 


1.00 


30.78 


A 


C 


ATOM 


188 


C 


PRO A 


23 


79.227 


73.886 


0.976 


1.00 


29.87 


A 


C 


ATOM 


189 


0 


PRO A 


23 


79.338 


73.300 


2.063 


1.00 


25.45 


A 


o 


ATOM 


190 


CB 


PRO A 


23 


80.875 


75.693 


0.407 


1.00 


31.63 


A 


C 


ATOM 


191 


CG 


PRO A 


23 


81.664 


75.968 


1,651 


1.00 


29.94 


A 


C 


ATOM 


192 


CD 


PRO A 


23 


80.727 


76.545 


2.629 


1.00 


33.02 


A 


c 


ATOM 


193 


N 


PRO A 


24 


78.894 


73.258 


-0.145 


1.00 


30.31 


A 


N 


ATOM 


194 


CA 


PRO A 


24 


78.559 


71.821 


-0.139 


1.00 


26.63 


A 


c 


ATOM 


195 


C 


PRO A 


24 


79.673 


70.857 


0.304 


1.00 


25.38 


A 


c 


ATOM 


196 


0 


PRO A 


24 


80.807 


70.925 


-0.155 


1.00 


25.17 


A 


0 


ATOM 


197 


CB 


PRO A 


24 


78.141 


71.536 


-1.593 


1.00 


28.22 


A 


c 


ATOM 


198 


CG 


PRO A 


24 


78.576 


72.715 


-2.410 


1.00 


32.40 


A 


c 


ATOM 


199 


CD 


PRO A 


24 


78.778 


73.874 


-1.484 


1.00 


33.13 


A 


c 


ATOM 


200 


N 


GLN A 


25 


79.292 


69.920 


1.169 


1.00 


24.26 


A 


N 


ATOM 


201 


CA 


GLN A 


25 


80.144 


68.839 


1.620 


1.00 


23.05 


A 


c 


ATOM 


202 


C 


GLN A 


25 


79.617 


67.576 


0.992 


1.00 


19.90 


A 


c 


ATOM 


203 


O 


GLN A 


25 


78.470 


67.220 


1,220 


1.00 


20.87 


A 


O 


ATOM 


204 


CB 


GLN A 


25 


80.075 


68.728 


3.127 


1.00 


20.92 


A 


C 


ATOM 


205 


CG 


GLN A 


25 


80.581 


69.995 


3.817 


1.00 


25.92 


A 


C 


ATOM 


206 


CD 


GLN A 


25 


80.491 


69.911 


5.317 


1.00 


24.91 


A 


C 


ATOM 


207 


OEl 


GLN A 


25 


80.742 


68.850 


5.894 


1.00 


21.17 


A 


0 


ATOM 


208 


NE2 


GLN A 


25 


80.153 


71.021 


5.957 


1.00 


26.06 


A 


N 


ATOM 


209 


N 


THR A 


26 


80.439 


66.926 


0.187 


1.00 


23.72 


A 


N 


ATOM 


210 


CA 


THR A 


26 


80.041 


65.699 


-0.495 


1.00 


23.00 


A 


C 


ATOM 


211 


C 


THR A 


26 


80.141 


64.498 


0.435 


1.00 


22.59 


A 


C 


ATOM 


212 


O 


THR A 


26 


81.151 


64.310 


1.103 


1.00 


23.44 


A 


O 


ATOM 


213 


CB 


THR A 


26 


80.943 


65.456 


-1.685 


1.00 


24.91 


A 


C 


ATOM 


214 


OGl 


THR A 


26 


80.891 


66.588 


-2.566 


1.00 


31.54 


A 


O 


ATOM 


215 


CG2 


THR A 


26 


80.428 


64.292 


-2.537 


1.00 


25.28 


A 


C 


ATOM 


216 


N 


LEU A 


27 


79,107 


63.666 


0.430 


1.00 


19.15 


A 


N 


ATOM 


217 


CA 


LEU A 


27 


79.093 


62.431 


1.198 


1.00 


18.03 


A 


C 


ATOM 


218 


C 


LEU A 


27 


78.394 


61.329 


0.375 


1.00 


22.50 


A 


c 


ATOM 




0 


LEU A 


27 


77 . 511 


61 . 636 


-0.415 


1 .00 


25 .14 


A 


o 


ATOM 


220 


CB 


LEU A 


27 


78.310 


62.637 


2.488 


1.00 


18.41 


A 


C 


ATOM 


221 


CG 


LEU A 


27 


78.805 


63-740 


3.447 


1.00 


23.17 


A 


c 


ATOM 


222 


CDl 


LEU A 


27 


77.737 


64.155 


4.429 


1.00 


28.47 


A 


c 


ATOM 


223 


CD2 


LEU A 


27 


80.040 


63.300 


4.174 


1.00 


22.35 


A 


c 


ATOM 


224 


N 


ASN A 


28 


78.804 


60.075 


0.562 


1.00 


19.63 


A 


N 


ATOM 


225 


CA 


ASN A 


28 


78.097 


58.926 


-0.013 


1.00 


18.44 


A 


c 


ATOM 


226 


C 


ASN A 


28 


77.098 


58.404 


0.985 


1.00 


17.41 


A 


c 


ATC»! 


227 


0 


ASN A 


28 


77.467 


58.130 


2.122 


1.00 


15.99 


A 


O 


ATOM 


228 


CB 


ASN A 


28 


79.059 


57.817 


-0.346 


1.00 


17.43 


A 


c 
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ATOM 


229 


CG 


ASN 


A 


28 


79 . 868 


58 .114 


-1 .556 


1.00 


22 . 09 


A 


C 


ATOM 


230 


ODl 


ASN 


A 


28 


79. 407 


58 .837 


-2,434 


1.00 


21,00 


A 


O 


ATOM 


231 


ND2 


ASN 


A 


28 


81 . 084 


57 ,573 


-1.622 


1.00 


22 . 09 


A 


N 


ATOM 


232 


N 


ILE 


A 


29 


75 .848 


58 .222 


0,566 


1.00 


13.33 


A 


N 


ATOM 


233 


OA 


ILE 


A 


29 


74 .741 


57 . 964 


1.501 


1.00 


15.06 


A 


C 


ATOM 


234 


C 


ILE 


A 


29 


73 .969 


56.724 


1.072 


1.00 


15. 98 


A 


C 


ATOM 


235 


0 


ILE 


A 


29 


73 .495 


56. 628 


-0.071 


1.00 


16 , 00 


A 


O 


ATOM 


236 


CB 


ILE 


A 


29 


73 .777 


59.164 


1.569 


1.00 


17 , 19 


A 


C 


ATOM 


237 


CGI 


ILE 


A 


29 


74.533 


60.443 


1.960 


1.00 


16.84 


A 


C 


ATOM 


238 


CG2 


ILE 


A 


29 


72.625 


58.876 


2.579 


1.00 


15.77 


A 


C 


ATOM 


239 


GDI 


ILE 


A 


29 


75.147 


60.409 


3.359 


1.00 


18 .72 


A 


C 


ATOM 


240 


N 


LEU 


A 


30 


73.829 


55.787 


1.997 


1.00 


15.17 


A 


N 


ATOM 


241 


CA 


LEU 


A 


30 


73 . 110 


54.541 


1.743 


1.00 


16. 63 


A 


C 


ATOM 


242 


C 


LEU 


A 


30 


71 . 623 


54.825 


1.455 


1.00 


17.89 


A 


C 


ATOM 


243 


0 


LEU 


A 


30 


71 . 000 


55.542 


2.186 


1.00 


17,80 


A 


o 


ATOM 


244 


CB 


LEU 


A 


30 


73 .251 


53.629 


2.964 


1.00 


14.92 


A 


C 


ATOM 


245 


CG 


LEU 


A 


30 


72 . 441 


52,335 


2.947 


1.00 


24 .85 


A 


C 


ATOM 


246 


GDI 


LEO 


A 


30 


73 . 456 


51.336 


1 .962 


0.00 


19.90 


A 


c 


ATOM 


247 


CD2 


LEU 


A 


30 


72 .418 


51.625 


4,210 


0.00 


19.96 


A 


C 


ATOM 


248 


N 


VAL 


A 


31 


71 , 059 


54 .224 


0.405 


1.00 


15.67 


A 


N 


ATOM 


249 


CA 


VAL 


A 


31 


69. 656 


54 .390 


0.066 


1.00 


17.96 


A 


C 


ATOM 


250 


C 


VAL 


A 


31 


68 . 865 


53.269 


0.715 


1.00 


18.65 


A 


C 


ATOM 


251 


0 


VAL 


A 


31 


69 . 101 


52.060 


0.440 


1.00 


21.01 


A 


O 


ATOM 


252 


CB 


VAL 


A 


31 


69 .461 


54 .358 


-1.471 


1.00 


21.10 


A 


C 


ATOM 


253 


CGI 


VAL 


A 


31 


67 . 991 


54 .309 


-1,806 


1.00 


23,22 


A 


C 


ATOM 


254 


CG2 


VAL 


A 


31 


70 . 102 


55.554 


-2.073 


1.00 


19, 69 


A 


c 


ATOM 


255 


N 


ASP 


A 


32 


67 . 936 


53 .656 


1,591 


1.00 


18.25 


A 


N 


ATOM 


256 


CA 


ASP 


A 


32 


67 .221 


52 .712 


2.456 


1.00 


20.14 


A 


C 


ATOM 


257 


C 


ASP 


A 


32 


65 .712 


52 . 942 


2. 457 


1.00 


18 .89 


A 


C 


ATOM 


258 


0 


ASP 


A 


32 


65 .217 


53 . 839 


3. 144 


1.00 


18.73 


A 


0 


ATOM 


259 


CB 


ASP 


A 


32 


67 .748 


52 .832 


3.905 


1.00 


20.81 


A 


C 


ATOM 


260 


CG 


ASP 


A 


32 


67 . 163 


51.747 


4.850 


1.00 


27.29 


A 


c 


ATOM 


261 


ODl 


ASP 


A 


32 


66. 652 


50.729 


4.345 


1.00 


28.02 


A 


0 


ATOM 


262 


0D2 


ASP 


A 


32 


67 . 178 


51.817 


6.113 


1.00 


29.94 


A 


o 


ATOM 


263 


N 


THR 


A 


33 


64 . 947 


52 . 108 


1.735 


1,00 


15.71 


A 


N 


ATOM 


264 


CA 


THR 


A 


33 


63 . 5O0 


52 .284 


1.753 


1.00 


16.65 


A 


C 


ATOM 


265 


C 


THR 


A 


33 


62 . 839 


51 . 643 


2. 958 


1.00 


18,62 


A 


C 


ATOM 


266 


0 


THR 


A 


33 


61. 627 


51.707 


3.086 


1.00 


19,27 


A 


O 


ATOM 


267 


CB 


THR 


A 


33 


62 ,855 


51.726 


0.459 


1. 00 


17.78 


A 


C 


ATOM 


268 


OGl 


THR 


A 


33 


63 . 088 


50.330 


0.395 


1.00 


17.76 


A 


O 


ATOM 


269 


CG2 


THR 


A 


33 


63.526 


52.289 


-0.756 


1.00 


20.47 


A 


C 


ATOM 


270 




GLY 


A 


34 


63.645 


51.078 


3.854 


1.00 


19.46 


A 


N 


ATOM 


271 


CA 


GLY 


A 


34 


63.137 


50.457 


5.065 


1.00 


22.82 


A 


C 


ATOM 


272 


C 


GLY 


A 


34 


63.251 


51.314 


6.315 


1.00 


24.98 


A 


C 


ATOM 


273 


0 


GLY 


A 


34 


63.033 


50.830 


7.434 


1.00 


24.60 


A 


O 


ATOM 


274 


N 


SER 


A 


35 


63. 601 


52.578 


6,130 


1.00 


18 .89 


A 


N 


ATOM 


275 


CA 


SER 


A 


35 


63 . 672 


53 . 543 


7.231 


1.00 


21 .21 


A 


c 


ATOM 


276 


C 


SER 


A 


35 


63 . 376 


54 . 978 


6.749 


1,00 


18.57 


A 


c 


ATOM 


277 


0 


SER 


A 


35 


63 .245 


55.229 


5.535 


1.00 


21.32 


A 


o 


ATOM 


278 


CB 


SER 


A 


35 


65 . 045 


53. 420 


7.880 


1,00 


21. 69 


A 


c 


ATOM 


279 


OG 


SER 


A 


35 


66.063 


53.982 


7,078 


1.00 


20 .28 


A 


o 


ATOM 


280 


N 


SER 


A 


36 


63.253 


55.940 


7.678 


1.00 


18 .30 


A 


N 


ATOM 


281 


CA 


SER A 


36 


62 .727 


57,267 


7.347 


1.00 


20.36 


A 


c 


ATOM 


282 


C 


SER 


A 


36 


63 .545 


58.455 


7.889 


1.00 


21*41 


A 


0 


ATOM 


283 


O 


SER 


A 


36 


63.101 


59.594 


7.809 


1.00 


19.92 


A 


O 


ATOM 


284 


CB 


SER 


A 


36 


61 .267 


57.375 


7.824 


1.00 


25.82 


A 


C 


ATOM 


285 


OG 


SER 


A 


36 


60. 485 


56.344 


7.230 


1,00 


25.30 


A 


O 


ATOM 


286 


N 


ASN 


A 


37 


64 .748 


58 .181 


8.396 


1.00 


19.59 


A 


N 


ATOM 


287 


CA 


ASN 


A 


37 


65 , 676 


59.222 


8.853 


1.00 


20.44 


A 


c 


ATOM 


288 


C 


ASN 


A 


37 


66 .852 


59.444 


7.907 


1. 00 


17 .40 


A 


C 


ATOM 


289 


0 


ASN 


A 


37 


67 . 426 


58 .484 


7.386 


1.00 


17 . 40 


A 


0 


ATOM 


290 


CB 


ASN 


A 


37 


66.262 


58 .847 


10,225 


1,00 


19 .75 


A 


c 


ATOM 


291 


CG 


ASN 


A 


37 


65 .330 


59,162 


11.365 


1. 00 


25 . 09 


A 


c 


ATOM 


292 


ODl 


ASN 


A 


37 


65 .323 


60 .288 


11.888 


1.00 


26.01 


A 


o 


ATOM 


293 


ND2 


ASN 


A 


37 


64.555 


58 .177 


11.776 


1.00 


21 . 61 


A 


N 


ATOM 


294 


H 


PHE 


A 


38 


67 .217 


60.704 


7 . 697 


1.00 


18 . 60 


A 


N 


ATOM 


295 


CA 


PHB 


A 


38 


68 . 450 


61.064 


7.013 


1.00 


17 .76 


A 


C 


ATOTl 


296 


C 


PHE 


A 


38 


69 . 494 


61.330 


8.089 


1 .00 


17 . 46 


A 


c 


ATOM 


297 


0 


PHE 


A 


38 


69.356 


62.288 


8.837 


1.00 


18.26 


A 


o 


ATOM 


298 


CB 


PHE 


A 


38 


68.236 


62.307 


6.143 


1.00 


17.46 


A 


c 


ATOM 


299 


CG 


PHE 


A 


38 


69.466 


62,776 


5.366 


1.00 


18.60 


A 


c 


ATOM 


300 


CDl 


PHE 


A 


38 


70.391 


61.896 


4.828 


1.00 


17,37 


A 


c 


ATOM 


301 


CD2 


PHE 


A 


38 


69.657 


64.124 


5.127 


1.00 


24.93 


A 


c 


ATOM 


302 


CEl 


PHE 


A 


38 


71.488 


62.350 


4.104 


1.00 


19,65 


A 


c 


ATOM 


303 


CE2 


PHE 


A 


38 


70.747 


64.586 


4.384 


1,00 


19,49 


A 


c 


ATOM 


304 


CZ 


PHE 


A 


38 


71.669 


63.701 


3,881 


1.00 


23.24 


A 


c 


ATOM 


305 


N 


ALA 


A 


39 


70.467 


60.430 


8.224 


1.00 


18.71 


A 


N 
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ATOM 


306 


CA 


ALA 


A 


39 


71 . 480 


60.508 


9 .272 


1 . 00 


18 .80 


A 


C 


ATOM 


307 


C 


ALA 


A 


39 


72.866 


60.348 


8 . 667 


1 . 00 


20 . 90 


A 


c 


ATOM 


308 


O 


ALA 


A 


39 


73. 104 


59.439 


7 .862 


1 . 00 


20 . 32 


A 


o 


ATOM 


309 


CB 


ALA 


A 


39 


71 .225 


59.457 


10.334 


1 .00 


17 . 93 


A 


c 


ATOM 


310 


N 


VAL 


A 


40 


73.792 


61.223 


9.058 


1 . 00 


19 .20 


A 


N 


ATOM 


311 


CA 


VAL 


A 


40 


75. 145 


61. 189 


8.526 


1.00 


18 .03 


A 


C 


ATOM 


312 


C 


VAL 


A 


40 


76.193 


61.242 


9. 640 


1 . 00 


18 .42 


A 


C 


ATOM 


313 


0 


VAL 


A 


40 


76.027 


61.985 


10.580 


1.00 


15 .83 


A 


0 


ATOM 


314 


CB 


VAL 


A 


40 


75.398 


62.372 


7.587 


1.00 


19.32 


A 


C 


ATOM 


315 


CGI 


VAL 


A 


40 


74.430 


62.354 


6.382 


1.00 


24.72 


A 


C 


ATOM 


316 


CG2 


VAL 


A 


40 


75 . 304 


63.711 


8 .319 


1 . 00 


25 .33 


A 


C 


ATOM 


317 


N 


GLY 


A 


41 


77 .272 


60.490 


9.488 


1 , 00 


18 ,41 


A 


N 


ATOM 


318 


CA 


GLY 


A 


41 


78 . 444 


60 . 626 


10 .354 


1 . 00 


13 . 03 


A 


C 


ATOM 


319 


C 


GLY 


A 


41 


78 . 921 


62 . 049 


10.4 63 


1 . 00 


16.57 


A 


C 


ATOM 


320 


O 


GLY 


A 


41 


78 . 986 


62.780 


9 . 486 


1 . 00 


16.35 


A 


0 


ATOM 


321 


N 


ALA 


A 


42 


79 . 186 


62 . 482 


11 . 688 


1 . 00 


18 . 46 


A 


N 


ATOM 


322 


CA 


ALA 


A 


42 


79 . 513 


63 .880 


11 . 952 


1 . 00 


16.09 


A 


C 


ATOM 


323 


C 


ALA 


A 


42 


80 . 745 


63 . 987 


12 .843 


1 . 00 


21 - 94 


A 


C 


ATOM 


324 


O 


ALA 


A 


42 


81 . 068 


65 .059 


13 .334 


1 . 00 


21 • 99 


A 


O 


ATOM 


325 


CB 


ALA 


A 


42 


78 . 326 


64 . 558 


12 . 613 


1.00 


19.21 


A 


C 


ATOM 


326 


N 


ALA 


A 


43 


81. 444 


62.873 


12 .985 


1.00 


17 .43 


A 


N 


ATOM 


327 


CA 


ALA 


A 


43 


82. 584 


62.752 


13 .899 


1.00 


19.03 


A 


C 


ATOM 


328 


C 


ALA 


A 


43 


83.590 


61.822 


13.222 


1.00 


22.11 


A 


C 


ATOM 


329 


O 


ALA 


A 


43 


83 . 186 


60 . 977 


12 . 414 


1. 00 


18 . 84 


A 


O 


ATOM 


330 


CB 


ALA 


A 


43 


82 . 131 


62.185 


15.216 


1. 00 


20.66 


A 


c 


ATOM 


331 


N 


PRO 


A 


44 


84 .880 


61 .964 


13.530 


1 . 00 


21 .75 


A 


N 


ATOM 


332 


CA 


PRO 


A 


44 


85 , 928 


61 . 128 


12.903 


1 . 00 


22 . 99 


A 


c 


ATOM 


333 


C 


PRO 


A 


44 


86. 039 


59. 692 


13.422 


1 . 00 


21 .03 


A 


c 


ATOM 


334 


0 


PRO 


A 


44 


87 .044 


59 .283 


13.989 


1. 00 


22 .42 


A 


0 


ATOM 


335 


CB 


PRO 


A 


44 


87 .204 


61.930 


13.173 


1.00 


23 . 97 


A 


c 


ATOM 


336 


CG 


PRO 


A 


44 


86. 923 


62 . 655 


14 .467 


1.00 


21 .28 


A 


c 


ATOM 


337 


CD 


PRO 


A 


44 


85 .466 


63.000 


14 .406 


1.00 


22 . 65 


A 


c 


ATOM 


338 


N 


HIS 


A 


45 


85.004 


58.904 


13.175 


1.00 


19.15 


A 


N 


ATOM 


339 


CA 


HIS 


A 


45 


85 .011 


57.491 


13.493 


1.00 


19.87 


A 


c 


ATOM 


340 


C 


HIS 


A 


45 


86.074 


56.884 


12.559 


1 . 00 


23.49 


A 


c 


ATOM 


341 


0 


HIS 


A 


45 


86. 161 


57 .279 


11.408 


1 . 00 


18 . 76 


A 


0 


ATOM 


342 


CB 


HIS 


A 


45 


83 . 600 


56.898 


13 .231 


1 . 00 


20 ,18 


A 


c 


ATOM 


343 


CG 


HIS 


A 


45 


83 .499 


55 ,426 


13 ,491 


1.00 


20.56 


A 


c 


ATOM 


344 


NDl 


HIS 


A 


45 


82 . 921 


54 . 900 


14 . 628 


1 . 00 


27 .21 


A 


N 


ATOM 


345 


CD2 


HIS 


A 


45 


83 . 911 


54 .369 


12 ,753 


1 . 00 


20 .97 


A 


c 


ATOM 


346 


CEl 


HIS 


A 


45 


82 . 989 


53.579 


14 .577 


1.00 


20.15 


A 


c 




347 


NB2 


HIS 


A 


45 


83 .572 


53.234 


13.443 


1.00 


26.79 


A 


N 


ATOtl 


348 


N 


PRO 


A 


46 


86. 900 


55.958 


13 .039 


1.00 


23.59 


A 


N 


ATCX4 


349 


CA 


PRO 


A 


46 


87 , 999 


55.418 


12 .221 


1.00 


26.27 


A 


c 


ATOM 


350 


C 


PRO 


A 


46 


87 . 618 


54 ,722 


10.881 


1.00 


23.39 


A 


c 


ATOM 


351 


0 


PRO 


A 


46 


88 . 449 


54 . 679 


9 .975 


1. 00 


27 .08 


A 


o 


ATOM 


352 


CB 


PRO 


A 


46 


88 . 677 


54 .416 


13 .175 


1, 00 


24 .42 


A 


c 


ATOM 


353 


CG 


PRO 


A 


46 


87 . 621 


54 .034 


14 .147 


1. 00 


27 .39 


A 


c 


ATOM 


354 


CD 


PRO 


A 


46 


86. 863 


55.335 


14.378 


1.00 


25.05 


A 


c 


ATOM 


355 


N 


PHE 


A 


47 


86. 410 


54 .192 


10 .783 


1.00 


25.26 


A 


N 


ATOM 


356 


CA 


PHE 


A 


47 


85 . 924 


53 .538 


9.560 


1. 00 


25.03 


A 


C 


ATOM 


357 


C 


PHE 


A 


47 


85.523 


54.517 


8.446 


1. 00 


22.84 


A 


c 


ATOM 


358 


0 


PHE 


A 


47 


85.309 


54.084 


7.325 


1.00 


25.36 


A 


o 


ATOM 


359 


CB 


PHE 


A 


47 


84 . 678 


52.671 


9.832 


1.00 


27.84 


A 


c 


ATOM 


360 


CG 


PHE 


A 


47 


84 . 888 


51.503 


10.769 


1.00 


32 .30 


A 


c 


AT^ 


361 


CDl 


PHE 


A 


47 


86. 141 


51.176 


11.282 


1.00 


36.05 


A 


c 


ATOM 


362 


CD2 


PHE 


A 


47 


83 . 794 


50 .722 


11 .134 


1 . 00 


35 . 59 


A 


c 


AT^ 


363 


CEl 


PHE 


A 


47 


86 . 297 


50 ,098 


12 . 133 


1 . 00 


32 .80 


A 


c 


ATOM 


364 


CE2 


PHE 


A 


47 


83.945 


49 . 635 


12 .004 


1 . 00 


36 . 20 


A 


c 


ATOM 


365 


CZ 


PHE 


A 


47 


85 . 197 


49 . 326 


12 .489 


1 . 00 


37 .31 


A 


c 


ATC^ 


366 


N 


LEU 


A 


48 


85 . 377 


55 .804 


8 .761 


1 . 00 


19 . 13 


A 


N 


ATOM 


367 


CA 


LEU 


A 


48 


84 . 818 


56.789 


7 ,835 


1 . 00 


18.71 


A 


C 


ATOM 


368 


C 


LEU 


A 


48 


85.829 


57 .499 


6. 963 


1. 00 


22 .04 


A 


c 


ATOM 


369 


0 


LEU 


A 


48 


86.798 


58 .086 


7 .451 


1.00 


22.43 


A 


0 


ATOM 


370 


CB 


LEU 


A 


48 


84 .019 


57 .848 


6 . 602 


1.00 


17.69 


A 


c 


AT^ 


371 


CG 


LEU 


A 


48 


82 .797 


57.361 


9.367 


1.00 


14 .97 


A 


c 


ATOM 


372 


CDl 


LEU 


A 


48 


82 .068 


58 .567 


9.926 


1.00 


18.29 


A 


c 


AT^ 


373 


CD2 


LEU 


A 


48 


81 . 839 


56.567 


8.517 


1 . 00 


19.80 


A 


c 


ATOM 


374 


N 


HIS 


A 


49 


85.553 


57.517 


5.666 


1.00 


19.90 


A 


N 


ATOM 


375 


CA 


HIS 


A 


49 


86.310 


58.348 


4.715 


1.00 


23.16 


A 


C 


ATOM 


376 


C 


HIS 


A 


49 


86.115 


59.862 


4.903 


1.00 


23.74 


A 


C 


ATOM 


377 


O 


HIS 


A 


49 


87.033 


60.658 


4.676 


1.00 


24,96 


A 


0 


ATOM 


378 


CB 


HIS 


A 


49 


85.901 


58.027 


3.277 


1.00 


24.78 


A 


C 


ATOM 


379 


CG 


HIS 


A 


49 


86.253 


56.648 


2.822 


1.00 


18.81 


A 


c 


ATOM 


380 


NDl 


HIS 


A 


49 


87.368 


56.386 


2.054 


1.00 


23.64 


A 


N 


ATOM 


381 


CD2 


HIS 


A 


49 


85.623 


55,463 


2.989 


1.00 


17.53 


A 


c 


ATOM 


382 


CEl 


HIS 


A 


49 


87.408 


55.095 


1.779 


1.00 


20.49 


A 


c 
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ATOM 


383 


NE2 


HIS 


A 


49 


86.361 


54.512 


2.331 


1.00 


25.00 


A 


N 


ATOM 


384 


N 


ARG 


A 


50 


84.900 


60.274 


5.255 


1.00 


23.13 


A 


N 


ATOM 


385 


CA 


ARG 


A 


50 


84.603 


61.682 


5.496 


1.00 


24 .92 


A 


C 


ATOM 


386 


C 


ARG 


A 


50 


83.387 


61.768 


6, 


.398 


1.00 


22.50 


A 


C 


ATOM 


387 


O 


ARG 


A 


50 


82.761 


60.763 


6, 


.692 


1.00 


20. 11 


A 


0 


ATOM 


388 


CB 


ARG 


A 


50 


84.335 


62.435 


4, 


.200 


1.00 


31,00 


A 


c 


ATOM 


389 


CG 


ARG 


A 


50 


84.028 


61.549 


3. 


.065 


1.00 


30.52 


A 


c 


ATOM 


390 


CD 


ARG 


A 


50 


83.871 


62.231 


1, 


.758 


1.00 


33.45 


A 


c 


ATOM 


391 


NE 


ARG 


A 


50 


83.103 


61.374 


0.862 


1.00 


35.30 


A 


N 


ATOM 


392 


CZ 


ARG 


A 


50 


82.912 


61.613 


-0. 


.430 


1.00 


41.98 


A 


c 


ATOM 


393 


NHl 


ARG 


A 


50 


83.440 


62.692 


-1. 


.000 


1.00 


41.62 


A 


N 


ATOM 


394 


NH2 


ARG 


A 


50 


82.188 


60.765 


-1.159 


1.00 


41.63 


A 


N 


ATOM 


395 


N 


TYR 


A 


51 


83.097 


62.978 


6, 


,868 


1.00 


19.69 


A 


N 


ATOM 


396 


CA 


TYR 


A 


51 


81.968 


63.193 


7 . 


.727 


1.00 


19.01 


A 


C 


ATOM 


397 


C 


TYR 


A 


51 


81,513 


64.641 


7, 


.644 


1.00 


17.45 


A 


C 


ATOM 


398 


0 


TYR 


A 


51 


82.257 


65.509 


7. 


,198 


1.00 


19.82 


A 


O 


ATOM 


399 


CB 


TYR 


A 


51 


82.305 


62.792 


9. 


.175 


1.00 


17.00 


A 


C 


ATOM 


400 


CG 


TYR 


A 


51 


83.594 


63.414 


9. 


.694 


1.00 


19.81 


A 


C 


ATOM 


401 


CDl 


TYR 


A 


51 


84.807 


62.799 


9. 


.494 


1.00 


22.49 


A 


C 


ATOM 


402 


CD2 


TYR 


A 


51 


83.574 


64.625 


10, 


.391 


1.00 


27.51 


A 


C 


ATOM 


403 


CEl 


TYR 


A 


51 


85.996 


63.363 


9. 


.962 


1.00 


29.01 


A 


C 


ATOM 


404 


CE2 


TYR 


A 


51 


84.755 


65.198 


10, 


.853 


1.00 


22.34 


A 


C 


ATOM 


405 


CZ 


TYR 


A 


51 


85.959 


64.561 


10. 


.639 


1.00 


26.38 


A 


C 


ATOM 


406 


OH 


TYR 


A 


51 


87.153 


65.103 


11. 


.102 


1.00 


27.75 


A 


0 


ATOM 


407 


N 


TYR 


A 


52 


80.267 


64 .861 


8. 


.039 


1.00 


16.76 


A 


N 


ATOM 


408 


CA 


TYR 


A 


52 


79.630 


66.167 


8. 


.044 


1.00 


15. 41 


A 


C 


ATOM 


409 


C 


TYR 


A 


52 


80.251 


67.057 


9. 


.094 


1.00 


18.19 


A 


C 


ATOM 


410 


O 


TYR 


A 


52 


80.252 


66.703 


10. 


,268 


1,00 


18.86 


A 


o 


ATOM 


411 


CB 


TYR 


A 


52 


78.163 


65.968 


8. 


,360 


1 .00 


16.96 


A 


c 


ATOM 


412 


CG 


TYR 


A 


52 


77.241 


67.158 


8. 


,365 


1.00 


17.78 


A 


c 


ATOM 


413 


CDl 


TYR 


A 


52 


77.491 


68.311 


7, 


,617 


1.00 


19.54 


A 


c 


ATOM 


414 


CD2 


TYR 


A 


52 


76.057 


67.095 


9. 


,075 


1.00 


20.48 


A 


c 


ATOM 


415 


CEl 


TYR 


A 


52 


76.608 


69.378 


7. 


,664 


1.00 


17.41 


A 


c 


ATOM 


416 


CE2 


TYR 


A 


52 


75.160 


68.137 


9. 


,089 


1.00 


21.75 


A 


c 


ATOM 


417 


CZ 


TYR 


A 


52 


75.443 


69.280 


8. 


,373 


1.00 


20.07 


A 


c 


ATOM 


418 


OH 


TYR 


A 


52 


74.507 


70.291 


8. 


424 


1.00 


24.27 


A 


0 


ATOM 


419 


N 


GLN 


A 


53 


80.748 


68.214 


8. 


671 


1.00 


21.06 


A 


N 


ATOM 


420 


CA 


GLN 


A 


53 


81.372 


69.186 


9, 


580 


1.00 


22.83 


A 


c 


ATOM 


421 


C 


GLN 


A 


53 


80.474 


70.420 


9, 


662 


1.00 


18.33 


A 


c 


ATOM 


422 


o 


GLN 


A 


53 


80.601 


71.340 


8. 


878 


1.00 


23.76 


A 


o 


ATOM 


423 


CB 


GLN 


A 


53 


82.779 


69.535 


9. 


079 


1.00 


22.30 


A 


c 


ATOM 


424 


CG 


GLN 


A 


53 


83.750 


68.353 


9. 


108 


1.00 


24.84 


A 


c 


ATOM 


425 


CD 


GLN 


A 


53 


85.187 


68.690 


8.695 


1.00 


31.20 


A 


c 


ATOM 


426 


OEl 


GLN 


A 


53 


85.490 


68.915 


7, 


504 


1.00 


32.31 


A ■ 


o 


ATOM 


427 


ME2 


GLN 


A 


53 


86.080 


68.696 


9. 


671 


1.00 


27.07 


A 


N 


ATOM 


428 


N 


ARG 


A 


54 


79.537 


70.385 


10. 


597 


1.00 


20.86 


A 


N 


ATOM 


429 


CA 


ARG 


A 


54 


78.545 


71.442 


10. 


758 


1.00 


21.52 


A 


c 


ATOM 


430 


C 


ARG 


A 


54 


79.164 


72.827 


10, 


939 


1.00 


25.15 


A 


c 


ATOM 


431 


0 


ARG 


A 


54 


78.568 


73.828 


10. 


536 


1.00 


26.20 


A 


0 


ATOM 


432 


CB 


ARG 


A 


54 


77.629 


71.138 


11. 


918 


1.00 


21.46 


A 


C 


ATOM 


433 


CG 


ARG 


A 


54 


76.652 


69.995 


11. 


655 


1.00 


22.36 


A 


C 


ATOM 


434 


CD 


ARG 


A 


54 


75.989 


69.437 


12.869 


1.00 


24.51 


A 


C 


ATOM 


435 


NE 


ARG 


A 


54 


76.919 


68.779 


13. 


780 


1.00 


20.24 


A 


N 


ATOM 


436 


CZ 


ARG 


A 


54 


76.609 


68.376 


14.997 


1.00 


23.34 


A 


C 


ATOM 


437 


NHl 


ARG 


A 


54 


75.389 


68.574 


15. 


485 


1.00 


26.99 


A 


N 


ATOM 


438 


NH2 


ARG 


A 


54 


77.534 


67.786 


15. 


739 


1.00 


21.22 


A 


N 


ATOM 


439 


N 


GLN 


A 


55 


80.362 


72.880 


11. 


523 


1.00 


25.18 


A 


N 


ATOM 


440 


CA 


GLN 


A 


55 


81.055 


74.153 


11. 


741 


1.00 


25.49 


A 


C 


ATOM 


441 


C 


GLN 


A 


55 


81.403 


74.886 


10. 


453 


1.00 


27.22 


A 


c 


ATOM 


442 


O 


GLN 


A 


55 


81.623 


76,106 


10. 


471 


1,00 


31.96 


A 


o 


ATOM 


443 


CB 


GLN 


A 


55 


82.342 


73.951 


12, 


586 


1.00 


25.44 


A 


c 


ATOM 


444 


CG 


GLN 


A 


55 


83.508 


73.285 


11. 


866 


1.00 


26.87 


A 


c 


ATOM 


445 


CD 


GLN 


A 


55 


83.607 


71.787 


12.100 


1.00 


22.47 


A 


c 


ATOM 


446 


OEl 


GLN 


A 


55 


84.649 


71.186 


11. 


858 


1.00 


28.14 


A 


o 


ATOM 


447 


NE2 


GLN 


A 


55 


82.531 


71.192 


12. 


526 


1.00 


19.06 


A 


N 


ATOM 


448 


N 


LEO 


A 


56 


81.478 


74.148 


9. 


347 


1.00 


26.29 


A 


N 


ATOM 


449 


CA 


LEU 


A 


56 


81.846 


74.711 


8.055 


1.00 


26.09 


A 


c 


ATOM 


450 


C 


LEU 


A 


56 


80 . 646 


75 .193 


7. 


224 


1 . 00 


28 .01 


A 


c 


liTm 


451 


o 


LEO 


A 


56 


80.835 


75.716 


6. 


131 


1.00 


30.64 


A 


o 


ATOM 


452 


CB 


LEU 


A 


56 


82.667 


73.703 


7. 


251 


1.00 


28.42 


A 


c 


ATOM 


453 


CG 


LEU 


A 


56 


83,966 


73,147 


7. 


849 


1.00 


29.81 


A 


c 


ATOM 


454 


CDl 


LEU 


A 


56 


84.685 


72.309 


6. 


814 


1.00 


33.56 


A 


c 


ATOM 


455 


CD2 


LEU 


A 


56 


84.896 


74.243 


8. 


364 


1.00 


28.02 


A 


c 


ATOM 


456 


N 


SER 


A 


57 


79.432 


75.055 


7. 


760 


1.00 


27.95 


A 


N 


ATOM 


457 


CA 


SER 


A 


57 


78.199 


75.322 


7.009 


1.00 


27.26 


A 


c 


ATOM 


458 


C 


SER 


A 


57 


77.432 


76.528 


7. 


548 


1.00 


26.45 


A 


c 


ATOM 


459 


O 


SER 


A 


57 


76.970 


76.523 


8.701 


1.00 


27,40 


A 


o 
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ATOM 


460 


CB 


SER 


A 


57 


77.287 


74.086 


7 .037 


1 .00 


27 . 30 


A 


C 


ATOM 


461 


OG 


SER 


A 


57 


76.004 


74.353 


6.482 


1 .00 


24 . 82 


A 


0 


ATOM 


462 


N 


SER 


A 


58 


77.250 


77.541 


6.704 


1 . 00 


31.30 


A 


N 


ATOM 


463 


CA 


SER 


A 


58 


76.540 


78.753 


7 . 112 


1 . 00 


33 . 18 


A 


C 


ATOM 


464 


C 


SER 


A 


58 


75.049 


78 . 502 


7 .294 


1 . 00 


33.96 


A 


C 


ATOM 


465 


O 


SER 


A 


58 


74 .367 


79. 198 


8 . 059 


1 . 00 


31 . 39 


A 


0 


ATOM 


466 


CB 


SER 


A 


58 


76.761 


79. 879 


6. 097 


1 .00 


35 . 14 


A 


c 


ATOM 


467 


OG 


SER 


A 


58 


76.449 


79.481 


4 .769 


1 .00 


35, 98 


A 


0 


ATOM 


468 


N 


THR 


A 


59 


74.552 


77.473 


6. 608 


1 .00 


31 . 44 


A 


N 


ATOM 


469 


CA 


THR 


A 


59 


73.128 


77.222 


6. 528 


1 .00 


28 . 82 


A 


c 


ATOM 


470 


C 


THR 


A 


59 


72.637 


76.209 


7. 545 


1 .00 


27 .75 


A 


C 


ATOM 


471 


0 


THR 


A 


59 


71.431 


75.989 


7 . 648 


1 . 00 


26.38 


A 


0 


ATOM 


472 


CB 


THR 


A 


59 


72.745 


76.825 


5 . 079 


1 . 00 


30 . 74 


A 


C 


ATOM 


473 


OGl 


THR 


A 


59 


73.712 


75.937 


4 . 512 


1 . 00 


26.79 


A 


O 


ATOM 


474 


CG2 


THR 


A 


59 


72 .851 


78 .040 


4 . 175 


1 . 00 


31 . 50 


A 


c 


ATOM 


475 


N 


TYR 


A 


60 


73.559 


75.630 


8 .325 


1 .00 


25 .76 


A 


N 


ATOM 


476 


CA 


TYR 


A 


60 


73.204 


74 .716 


9 . 405 


1 . 00 


27 . 01 


A 


C 


ATOM 


477 


C 


TYR 


A 


60 


72.359 


75.391 


10 . 487 


1 . 00 


30 . 17 


A 


c 


ATOM 


478 


O 


TYR 


A 


60 


72 . 671 


76,504 


10.908 


1 .00 


32.85 


A 


0 


ATOM 


479 


CB 


TYR 


A 


60 


74 .475 


74.108 


10.024 


1 . 00 


29. 24 


A 


c 


ATOM 


480 


CG 


TYR 


A 


60 


74 .208 


73,401 


11.319 


1 . 00 


32 . 58 


A 


c 


ATOM 


481 


CDl 


TYR 


A 


60 


73 . 616 


72 .137 


11.341 


1 . 00 


33 . 45 


A 


c 


ATOM 


482 


C02 


TYR 


A 


60 


74 .507 


74 .016 


12 .539 


1 .00 


35 .22 


A 


c 


ATOM 


483 


CEl 


TYR 


A 


60 


73.344 


71 .495 


12 .545 


1 . 00 


34 . 91 


A 


c 


ATOM 


484 


CE2 


TYR 


A 


60 


74 .242 


73.384 


13 .741 


1 . 00 


35 . 99 


A 


c 


ATOM 


485 


CZ 


TYR 


A 


60 


73 . 661 


72 . 128 


13 .739 


1 . 00 


36.24 


A 


c 


ATOM 


486 


OH 


TYR 


A 


60 


73 . 406 


71.510 


14 . 936 


1 . 00 


40 .70 


A 


o 


ATOM 


487 


N 


ARG 


A 


61 


71 .302 


74 .710 


10 . 934 


1 . 00 


29.78 


A 


N 


ATOM 


488 


CA 


ARG 


A 


61 


70,489 


75 . 137 


12 . 074 


1 . 00 


32 .29 


A 


c 


ATOM 


489 


C 


ARG 


A 


61 


70.289 


73.992 


13.056 


1 . 00 


35 . 05 


A 


c 


ATOM 


490 


0 


ARG 


A 


61 


69.781 


72.931 


12. 695 


1.00 


33.45 


A 


0 


ATOM 


491 


CB 


ARG 


A 


61 


69.113 


75. 638 


11. 635 


1.00 


34.98 


A 


c 


ATOM 


492 


CG 


ARG 


A 


61 


69.146 


76,790 


10.663 


1.00 


33.55 


A 


c 


ATOM 


493 


CD 


ARG 


A 


61 


67.756 


77.209 


10.187 


1 . 00 


39. 45 


A 


c 


ATOM 


494 


NE 


ARG 


A 


61 


67 .802 


78 . 053 


8 . 991 


1 . 00 


43 . 50 


A 


N 


ATOM 


495 


CZ 


ARG 


A 


61 


66.737 


78 . 400 


8 . 267 


1 . 00 


43 . 32 


A 


C 


ATOM 


496 


NHl 


ARG 


A 


61 


65 .517 


77 . 969 


8 . 591 


1 .00 


43 . 64 


A 


N 


ATOM 


497 


NH2 


ARG 


A 


61 


66. 896 


79 . 173 


7 . 201 


1 . 00 


43 . 55 


A 


N 


ATOM 


498 


N 


ASP 


A 


62 


70 . 681 


74 .222 


14 . 302 


1 . 00 


32 . 81 


A 


N 


ATOM 


499 


CA 


ASP 


A 


62 


70 . 488 


73 .277 


15. 385 


1 .00 


34 . 32 


A 


C 


ATOM 


500 


C 


ASP 


A 


62 


69.019 


73 .222 


15. 812 


1.00 


35.83 


A 


C 


ATOM 


501 


0 


ASP 


A 


62 


68.368 


74.257 


15.972 


1.00 


37.43 


A 


O 


ATOM 


502 


CB 


ASP 


A 


62 


71.385 


73.703 


16.561 


1.00 


36.21 


A 


C 


ATOM 


503 


CG 


ASP 


A 


62 


71.724 


72.567 


17.509 


1.00 


37.73 


A 


C 


ATOM 


504 


ODl 


ASP 


A 


62 


71.078 


71.513 


17. 462 


1.00 


39.38 


A 


O 


ATOM 


505 


OD2 


ASP 


A 


62 


72.632 


72 . 654 


18 .366 


1.00 


38.06 


A 


O 


ATOM 


506 


N 


LEO 


A 


63 


68.504 


72 .009 


16. 000 


1.00 


32 .04 


A 


N 


ATOM 


507 


CA 


LEU 


A 


63 


67.151 


71.799 


16.496 


1.00 


33 .21 


A 


C 


ATOM 


508 


C 


LEU 


A 


63 


67 . 155 


71.580 


18 . 003 


1 .00 


31 . 37 


A 


C 


ATOM 


509 


O 


LEU 


A 


63 


66. 108 


71 .522 


18 . 621 


1 . 00 


33 . 62 


A 


0 


ATOM 


510 


CB 


LEU 


A 


63 


66.489 


70 . 603 


15 .793 


1 . 00 


32 .30 


A 


C 


ATOM 


511 


CG 


LEU 


A 


63 


65 .919 


70 . 957 


14 . 417 


1 . 00 


37 . 47 


A 


c 


ATOM 


512 


CDl 


LEU 


A 


63 


65 .566 


69. 688 


13 . 604 


1 • 00 


37 .52 


A 


c 


ATOM 


513 


CD2 


LEU 


A 


63 


64 . 696 


71 * 880 


14 .549 


1 .00 


37 .36 


A 


c 


ATOM 


514 


N 


ARG 


A 


64 


68 .345 


71.460 


18 . 580 


1.00 


34 . 84 


A 


N 


ATOM 


515 


CA 


ARG 


A 


64 


68 .514 


71 .279 


20 • 012 


1 . 00 


34 , 85 


A 


C 


ATOM 


516 


C 


ARG 


A 


64 


67 . 687 


70 . 109 


20 . 516 


1 . 00 


37 .89 


A 


C 


ATOM 


517 


0 


ARG 


A 


64 


66. 925 


70 .220 


21 .474 


1 . 00 


37 . 04 


A 


0 


ATOM 


518 


CB 


ARG 


A 


64 


68.180 


72 .583 


20 . 753 


1.00 


37.97 


A 


C 


ATOM 


519 


CG 


ARG 


A 


64 


68.865 


73 .821 


20 . 152 


1 . 00 


37 . 97 


A 


c 


ATOM 


520 


CD 


ARG 


A 


64 


68.726 


75 .089 


21 . 000 


1.00 


41.38 


A 


c 


ATOM 


521 


NE 


ARG 


A 


64 


69.447 


74 . 699 


22 .367 


0. 00 


47 , 96 


A 


N 


ATOM 


522 


CZ 


ARG 


A 


64 


69.722 


75. 629 


23 .275 


0.00 


49. 03 


A 


C 


ATOM 


523 


NHl 


ARG 


A 


64 


69.491 


76.907 


23 . 009 


0.00 


49.64 


A 


N 


ATOM 


524 


NH2 


ARG 


A 


64 


70.226 


75.281 


24 . 451 


0.00 


49.89 


A 


N 


ATOM 


525 


N 


LYS 


A 


65 


67 .844 


68 .973 


19 . 843 


1.00 


34.71 


A 


N 


ATOM 


526 


CA 


LYS 


A 


65 


67 .212 


67 ,732 


20 . 266 


1. 00 


35 .06 


A 


C 


ATC^ 




c 


LYS 


A 


65 


68 • 076 
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A 


f» 
u 


ATOM 


528 


0 


LYS 


A 


65 


68.655 


66.665 


18.695 


1.00 


31.69 


A 


0 


ATOM 


529 


CB 


LYS 


A 


65 


65.801 


67.642 


19.676 


1.00 


39.80 


A 


c 


ATOM 


530 


CG 


LYS 


A 


65 


64.967 


66.448 


20.138 


1.00 


43.42 


A 


c 


ATOM 


531 


CD 


LYS 


A 


65 


63.513 


66.564 


19.672 


1.00 


47.97 


A 


c 


ATOM 


532 


CE 


LYS 


A 


65 


62.653 


65.440 


20.263 


1.00 


50,01 


A 


c 


ATOM 


533 


NZ 


LYS 


A 


65 


61.233 


65.463 


19.797 


1.00 


51.34 


A 


N 


ATOM 


534 


N 


GLY 


A 


66 


68.190 


65.522 


20.565 


1.00 


31.22 


A 




ATOM 


535 


CA 


GLY 


A 


66 


68.910 


64.339 


20.149 


1.00 


31.55 


A 


C 


ATOM 


536 


C 


GLY 


A 


66 


67.996 


63.249 


19.616 


1.00 


32.06 


A 


C 
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ATOM 


537 


O 


GLY 


A 


66 


66.772 


63.399 


19.632 


1.00 


33.71 


A 


0 


ATOM 


538 


N 


VAL 


A 


67 


68.617 


62.153 


19.163 


1.00 


30,61 


A 


N 


ATOM 


539 


CA 


VAL 


A 


67 


67.927 


60.946 


18 . 675 


1.00 


32.04 


A 


C 


ATOM 


540 


C 


VAL 


A 


67 


68.756 


59.693 


18.978 


1.00 


32.39 


A 


c 


ATOM 


541 


0 


VAL 


A 


67 


69.982 


59.724 


18.870 


1.00 


29.49 


A 


0 


ATOM 


542 


CB 


VAL 


A 


67 


67.663 


61.024 


17.158 


1.00 


34.97 


A 


c 


ATOM 


543 


CGI 


VAL 


A 


67 


66.568 


61.988 


16.878 


1.00 


40.45 


A 


c 


ATOM 


544 


CG2 


VAL 


A 


67 


68.912 


61.440 


16.387 


1.00 


36.19 


A 


c 


ATOM 


545 


N 


TYR 


A 


68 


68.108 


58.602 


19.384 


1.00 


32.50 


A 


N 


ATOM 


546 


CA 


TYR 


A 


68 


68.817 


57.361 


19.709 


1.00 


36.46 


A 


c 


ATOM 


547 


C 


TYR 


A 


68 


68.113 


56.190 


19.062 


1.00 


34.88 


A 


c 


ATOM 


548 


0 


TYR 


A 


68 


66.962 


55.916 


19.383 


1.00 


36.97 


A 


0 


ATOM 


549 


CB 


TYR 


A 


68 


68.902 


57.148 


21.229 


1.00 


36.07 


A 


c 


ATOM 


550 


CG 


TYR 


A 


68 


69.801 


55.993 


21.670 


1.00 


41.81 


A 


c 


ATOM 


551 


GDI 


TYR 


A 


68 


69.460 


54.665 


21.395 


1.00 


43.38 


A 


c 


ATOM 


552 


CD2 


TYR 


A 


68 


70.981 


56.226 


22.379 


1.00 


44.20 


A 


c 


ATOM 


553 


CEl 


TYR 


A 


68 


70.274 


53.605 


21.798 


1.00 


43.39 


A 


c 


ATOM 


554 


CE2 


TYR 


A 


68 


71.805 


55,167 


22.789 


1.00 


44.55 


A 


c 


ATOM 


555 


CZ 


TYR 


A 


68 


71.444 


53,863 


22.492 


1.00 


45.41 


A 


c 


ATOM 


556 


OH 


TYR 


A 


68 


72.242 


52.807 


22 .897 


1.00 


47.48 


A 


o 


ATOM 


557 


N 


VAL 


A 


69 


68.826 


55.477 


18.196 


1.00 


33.48 


A 


N 


ATOM 


558 


CA 


VAL 


A 


69 


68.249 


54.404 


17.376 


1.00 


34.57 


A 


c 


ATOM 


559 


C 


VAL 


A 


69 


68.922 


53.080 


17.716 


1.00 


34.34 


A 


c 


ATOM 


560 


O 


VAL 


A 


69 


69.996 


52.793 


17.192 


1.00 


28.53 


A 


o 


ATOM 


561 


CB 


VAL 


A 


69 


68.440 


54.691 


15.866 


1.00 


35.13 


A 


c 


ATOM 


562 


CGI 


VAL 


A 


69 


67.944 


53.526 


15.002 


1.00 


38.45 


A 


c 


ATOM 


563 


CG2 


VAL 


A 


69 


67.754 


56.000 


15.484 


1.00 


36.74 


A 


c 


ATOM 


564 


N 


PRO 


A 


70 


68.319 


52.269 


18.588 


1.00 


39.88 


A 


N 


ATOM 


565 


CA 


PRO 


A 


70 


68.846 


50.922 


18.830 


1.00 


43.50 


A 


C 


ATOM 


566 


C 


PRO 


A 


70 


68.577 


50,028 


17.629 


1.00 


47.11 


A 


C 


ATOM 


567 


O 


PRO 


A 


70 


67,551 


50.175 


16.960 


1.00 


41.77 


A 


o 


ATOM 


568 


CB 


PRO 


A 


70 


68.097 


50.428 


20.077 


1.00 


44.42 


A 


c 


ATOM 


569 


CG 


PRO 


A 


70 


67.031 


51.423 


20.368 


1.00 


43.58 


A 


c 


ATOM 


570 


CD 


PRO 


A 


70 


67.125 


52.554 


19.397 


1.00 


42.11 


A 


c 


ATOM 


571 


N 


TYR 


A 


71 


69.527 


49.140 


17.367 


1.00 


51.98 


A 


N 


ATOM 


572 


CA 


TYR 


A 


71 


69.474 


48.179 


16.276 


1.00 


56.73 


A 


C 


ATOM 


573 


C 


TYR 


A 


71 


69.683 


46.796 


16.908 


1.00 


58.39 


A 


c 


ATOM 


574 


0 


TYR 


A 


71 


69.428 


46.618 


18.105 


1.00 


57.75 


A 


o 


ATOM 


575 


CB 


TYR 


A 


71 


70.558 


48.519 


15.229 


1,00 


57.66 


A 


c 


ATOM 


57 6 


CG 


TYR 


A 


71 


70.091 


49.405 


14.090 


1.00 


59.91 


A 


c 


ATOM 


577 


CDl 


TYR 


A 


71 


70.760 


50.591 


13.779 


1.00 


60.36 


A 


c 


ATOM 


578 


CD2 


TYR 


A 


71 


68.995 


49,049 


13,304 


1.00 


61.59 


A 


c 


ATOM 


579 


CEl 


TYR 


A 


71 


70.334 


51.408 


12.725 


1.00 


60.84 


A 


c 


ATOM 


580 


CE2 


TYR 


A 


71 


68.568 


49.857 


12.249 


1.00 


62.07 


A 


c 


ATOM 


581 


CZ 


TYR 


A 


71 


69.241 


51.035 


11.966 


1.00 


63.27 


A 


c 


ATOM 


582 


OH 


TYR 


A 


71 


68.818 


51.840 


10.924 


1.00 


64.04 


A 


o 


ATOM 


583 


N 


THR 


A 


72 


70.147 


45.832 


16.114 


1.00 


61.01 


A 


N 


ATOM 


584 


CA 


THR 


A 


72 


70.319 


44.444 


16.556 


1.00 


60.90 


A 


c 


ATOM 


585 


C 


THR 


A 


72 


71.093 


44.294 


17.877 


1.00 


59.74 


A 


c 


ATOM 


586 


O 


THR 


A 


72 


70.491 


44.060 


18.931 


1.00 


58.04 


A 


0 


ATOM 


587 


CB 


THR 


A 


72 


70.993 


43.609 


15.431 


1.00 


62.06 


A 


c 


ATOM 


588 


OGl 


THR 


A 


72 


72,170 


44.276 


14.951 


1.00 


61.28 


A 


o 


ATOM 


589 


CG2 


THR 


A 


72 


70.090 


43.514 


14,196 


1,00 


63.15 


A 


c 


ATOM 


590 


N 


GLN 


A 


73 


72.418 


44.402 


17.800 


1.00 


57.85 


A 


N 


ATOM 


591 


CA 


GLN 


A 


73 


73,287 


44.461 


18.971 


1.00 


57.41 


A 


c 


ATOM 


592 


C 


GLN 


A 


73 


74.155 


45.726 


18.850 


1.00 


54.83 


A 


c 


ATOM 


593 


O 


GLN 


A 


73 


75.303 


45.747 


19.299 


1.00 


57.07 


A 


0 


ATOM 


594 


CB 


GLN 


A 


73 


74.153 


43.194 


19.060 


1.00 


58.83 


A 


c 


ATOM 


595 


CG 


GLN 


A 


73 


73.865 


42.294 


20.273 


1.00 


60.65 


A 


c 


ATOM 


596 


CD 


GLN 


A 


73 


74.720 


42.630 


21.504 


1.00 


63.27 


A 


c 


ATOM 


597 


OEl 


GLN 


A 


73 


75.959 


42.582 


21.450 


1.00 


61.11 


A 


o 


ATOM 


598 


NE2 


GLN 


A 


73 


74.058 


42.943 


22.619 


1.00 


61.34 


A 


N 


ATOM 


599 


N 


GLY 


A 


74 


73.591 


46.763 


18.223 


1.00 


48.16 


A 


N 


ATOM 


600 


CA 


GLY 


A 


74 


74.262 


48.041 


18.020 


1.00 


42.89 


A 


c 


ATOM 


601 


C 


GLY 


A 


74 


73.290 


49.214 


18.016 


1.00 


38.35 


A 


C 


ATOM 


602 


O 


GLY 


A 


74 


72.224 


49.115 


18.625 


1,00 


39.09 


A 


o 


ATOM 


603 


N 


LYS 


A 


75 


73.656 


50.320 


17.360 


1.00 


32.71 


A 


N 


ATOM 


604 


CA 


LYS 


A 


75 


72.844 


51.554 


17.362 


1.00 


31.39 


A 


C 


ATOM 


605 


C 


LYS 


A 


75 


73.525 


52.762 


16.664 


1.00 


24.85 


A 


C 


ATOM 


606 


o 


LYS 


A 


75 


74.685 


52.698 


16.338 


1.00 


21.55 


A 


o 


ATOM 


607 


CB 


LYS 


A 


75 


72.483 


51.946 


18.800 


1.00 


34.61 


A 


C 


ATOM 


608 


CG 


LYS 


A 


75 


73.667 


52.144 


19.731 


1.00 


39.32 


A 


C 


ATOM 


609 


CD 


LYS 


A 


75 


74.545 


53.318 


19.299 


1.00 


39.96 


A 


C 


ATOM 


610 


CE 


LYS 


A 


75 


75.034 


54.144 


20.451 


1.00 


40.91 


A 


c 


ATC»I 


611 


NZ 


LYS 


A 


75 


74.297 


55.407 


20.464 


1,00 


45.34 


A 


N 


ATOM 


612 


H 


TRP 


A 


76 


72.782 


53.843 


16.434 


1.00 


22.44 


A 


N 


ATOM 


613 


CA 


TRP 


A 


76 


73.372 


55.173 


16.224 


1.00 


25.02 


A 


c 
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ATOM 


614 


C 


TRP 


A 


76 


72.594 


56.201 


17.012 


1.00 


23.16 


A 


C 


ATOM 


615 


0 


TRP 


A 


76 


71.429 


56.007 


17.353 


1.00 


21 .34 


A 


O 


ATOM 


616 


CB 


TRP 


A 


76 


73.512 


55.570 


14.732 


1.00 


24 .36 


A 


c 


ATOM 


617 


CG 


TRP 


A 


76 


72.243 


55.752 


13,957 


1 .00 


25 ,79 


A 


c 


ATOM 


618 


GDI 


TRP 


A 


76 


71.643 


54.833 


13.136 


1.00 


26,34 


A 


c 


ATOM 


619 


CD2 


TRP 


A 


76 


71.424 


56.932 


13.896 


1.00 


21,29 


A 


c 


ATOM 


620 


NEl 


TRP 


A 


76 


70.491 


55.364 


12.595 


1.00 


27 ,23 


A 


N 


ATOM 


621 


CB2 


TRP 


A 


76 


70.348 


56.656 


13.030 


1.00 


25.91 


A 


c 


ATOM 


622 


CE3 


TRP 


A 


76 


71,497 


58.202 


14.479 


1.00 


24.01 


A 


c 


ATOM 


623 


CZ2 


TRP 


A 


76 


69.349 


57.595 


12.752 


1,00 


26.87 


A 


c 


ATOM 


624 


CZ3 


TRP 


A 


76 


70.512 


59.124 


14.202 


1.00 


26.34 


A 


c 


ATOM 


625 


CH2 


TRP 


A 


76 


69.448 


58.818 


13.345 


1.00 


25.94 


A 


c 


ATOM 


626 


N 


GLO 


A 


77 


73 .291 


57.271 


17 . 354 


1 .00 


22 .20 


A 


N 


ATOM 


627 


CA 


GLU 


A 


77 


72 .753 


58.327 


18. 164 


1.00 


24 .84 


A 


C 


ATOM 


628 


C 


GLU 


A 


77 


73 .255 


59.632 


17. 575 


1.00 


22 .78 


A 


C 


ATOM 


629 


0 


GLU 


A 


77 


74 .386 


59.723 


17 . 089 


1 .00 


19. 61 


A 


O 


ATOM 


630 


CB 


GLU 


A 


77 


73.214 


58 .140 


19. 621 


1 ,00 


28 .88 


A 


c 


ATOM 


631 


CG 


GLU 


A 


77 


72.959 


59.331 


20.529 


1 .00 


35.35 


A 


C 


ATOM 


632 


CD 


GLU 


A 


77 


73.323 


59.057 


21.980 


0.50 


36.38 


A 


c 


ATOM 


633 


OEl 


GLU 


A 


77 


74.397 


58 .470 


22.222 


0.50 


42.18 


A 


O 


ATC»1 


634 


0E2 


GLU 


A 


77 


72.536 


59.431 


22.878 


0.50 


39.02 


A 


o 


ATC»I 


635 


N 


GLY 


A 


78 


72.418 


60.651 


17.573 


1.00 


24 .09 


A 


N 


ATOM 


636 


CA 


GLY 


A 


78 


72,811 


61.883 


16.933 


1.00 


25.68 


A 


c 


ATOM 


637 


C 


GLY 


A 


78 


72. 160 


63 .134 


17.453 


1.00 


25.43 


A 


c 


ATOM 


638 


0 


GLY 


A 


78 


71. 328 


63 .116 


18.350 


1.00 


27 . 93 


A 


o 


ATOM 


639 


N 


GLU 


A 


79 


72 . 579 


64 .234 


16.861 


1 .00 


23 .79 


A 


N 


ATOM 


640 


CA 


GLU 


A 


79 


72 . 078 


65 .542 


17.187 


1 ,00 


23.88 


A 


c 


ATOM 


641 


C 


GLU 


A 


79 


71 . 283 


65 .981 


15,979 


1 ,00 


22 . 32 


A 


c 


ATOM 


642 


0 


GLU 


A 


79 


71 .800 


65.979 


14 . 875 


1.00 


26. 62 


A 


0 


ATOM 


643 


CB 


GLU 


A 


79 


73 .255 


66.487 


17.457 


1.00 


23.99 


A 


c 


ATOM 


644 


CG 


GLU 


A 


79 


74 .109 


66.052 


18 . 641 


1.00 


29. 60 


A 


c 


ATOM 


645 


CD 


GLU 


A 


79 


75.420 


66.826 


18.790 


1.00 


33.85 


A 


c 


ATOM 


646 


OEl 


GLU 


A 


79 


76.205 


66.467 


19,685 


1.00 


34.63 


A 


0 


ATOM 


647 


0E2 


GLU 


A 


79 


75.670 


67.782 


18,030 


1.00 


37.21 


A 


0 


ATOM 


648 


N 


LEU 


A 


80 


70.017 


66.338 


16.180 


1.00 


22,69 


A 


N 


ATOM 


649 


CA 


LEU 


A 


80 


69.184 


66.809 


15.075 


1.00 


24.78 


A 


c 


ATOM 


650 


C 


LEU 


A 


80 


69.419 


68.267 


14.685 


1.00 


26.56 


A 


c 


ATOM 


651 


0 


LEU 


A 


80 


69.596 


69.139 


15.528 


1.00 


26.06 


A 


o 


ATOM 


652 


CB 


LEU 


A 


80 


67.704 


66.617 


15.403 


1.00 


26.07 


A 


c 


ATOM 


653 


CG 


LEU 


A 


80 


67.233 


65.168 


15.432 


1.00 


31.35 


A 


c 


ATOM 


654 


CDl 


LEU 


A 


80 


65.863 


65.082 


16.077 


1.00 


28.71 


A 


c 


ATOM 


655 


CD2 


LEU 


A 


80 


67.212 


64.609 


14.015 


1.00 


32.32 


A 


c 


ATOM 


656 


N 


GLY 


A 


81 


69.390 


68.525 


13.383 


1.00 


23.46 


A 


N 


ATOM 


657 


CA 


GLY 


A 


81 


69.500 


69.861 


12.822 


1.00 


22.46 


A 


c 


ATOM 


658 


C 


GLY 


A 


81 


68.854 


69.916 


11.448 


1.00 


26.63 


A 


C 


ATOM 


659 


0 


GLY 


A 


81 


68 .308 


68 .927 


11.002 


1.00 


22 .44 


A 


o 


ATOM 


660 


N 


THR 


A 


82 


68 .884 


71 .065 


10 .787 


1.00 


26.31 


A 


N 


ATOM 


661 


CA 


THR 


A 


82 


68 . 530 


71 .138 


9. 369. 


1.00 


28 .52 


A 


C 


ATOM 


662 


C 


THR 


A 


82 


69. 634 


71 .813 


8. 631 


1.00 


25.22 


A 


c 


ATOM 


663 


0 


THR 


A 


82 


70. 436 


72 .529 


9.225 


1 .00 


27 .82 


A 


o 


ATOM 


664 


CB 


THR 


A 


82 


67 . 190 


71.888 


9.127 


1.00 


29.47 


A 


c 


ATOM 


665 


OGl 


THR 


A 


82 


67.310 


73.253 


9.554 


1.00 


27.90 


A 


0 


ATOM 


666 


CG2 


THR 


A 


82 


66.069 


71.306 


9.972 


1.00 


30.70 


A 


C 


ATOM 


667 


N 


ASP 


A 


83 


69.704 


71.567 


7.326 


1.00 


24.75 


A 


N 


ATOM 


668 


CA 


ASP 


A 


83 


70. 679 


72.180 


6.447 


1.00 


22.11 


A 


C 


ATOM 


669 


C 


ASP 


A 


83 


70.241 


71.993 


5.009 


1.00 


24.09 


A 


C 


ATOM 


670 


O 


ASP 


A 


83 


69.261 


71,285 


4.741 


1.00 


26.17 


A 


0 


ATOM 


671 


CB 


ASP 


A 


83 


72 .075 


71 .559 


6.652 


1.00 


24.10 


A 


c 


ATOM 


672 


CG 


ASP 


A 


83 


73 .213 


72 .542 


6.376 


1.00 


26.19 


A 


c 


ATOM 


673 


ODl 


ASP 


A 


83 


73 .067 


73 .513 


5.580 


1.00 


27.95 


A 


o 


ATOM 


674 


OD2 


ASP 


A 


83 


74.328 


72 .409 


6.924 


1.00 


25.64 


A 


o 


ATOM 


675 


N 


LEU 


A 


84 


70. 973 


72 .591 


4 .081 


1.00 


26.89 


A 


N 


ATOM 


676 


CA 


LEU 


A 


84 


70. 641 


72.502 


2.658 


1.00 


27.35 


A 


c 


ATOM 


677 


C 


LEU 


A 


84 


71.224 


71.225 


2.078 


1.00 


28.51 


A 


C 


ATOM 


678 


O 


LEO 


A 


84 


72.398 


70 .936 


2.266 


1.00 


25.25 


A 


0 


ATOM 


679 


CB 


LEU 


A 


84 


71 .193 


73.717 


1.915 


1.00 


29.63 


A 


C 


ATOM 


680 


CG 


LEU 


A 


84 


70 . 550 


75.047 


2.345 


1 .00 


31.38 


A 


C 


ATOM 


681 


CDl 


LEU 


A 


84 


71 . 025 


76 . 228 


1 . 501 


1 . 00 


30 . 45 


A 


c 


ATOM 


682 


CD2 


LEU 


A 


84 


69.027 


74.949 


2.301 


1.00 


30,98 


A 


c 


ATOM 


683 


N 


VAL 


A 


85 


70.392 


70.465 


1.373 


1,00 


25.49 


A 




ATOM 


684 


CA 


VAL 


A 


85 


70.790 


69.203 


0,768 


1.00 


28.08 


A 


C 


ATOM 


685 


C 


VAL 


A 


85 


70.523 


69.177 


-0,737 


1.00 


27.65 


A 


C 


ATOM 


686 


O 


VAL 


A 


85 


69.511 


69.686 


-1.213 


1.00 


27 ,43 


A 


0 


ATCW 


687 


CB 


VAL 


A 


85 


70.063 


68.028 


1.439 


1.00 


27.18 


A 


C 


ATOM 


688 


CGI 


VAL 


A 


85 


70.564 


66.696 


0.875 


1.00 


27.97 


A 


C 


ATOM 


689 


CG2 


VAL 


A 


85 


70.273 


68.084 


2.950 


1.00 


29.93 


A 


C 


ATOM 


690 


N 


SER 


A 


86 


71.451 


68.587 


-1,472 


1.00 


28.67 


A 


N 
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ATOM 


691 


OA 


SBR A 


ATOM 


692 


C 


SER 


A 


ATOM 


693 


0 


SER 


A 


ATOM 


694 


CB 


SER 


A 


ATOM 


695 


OG 


SER 


A 


ATOM 


696 


N 


ILE 


A 


ATOM 


697 


CA 


ILE 


A 


ATOM 


698 


C 


ILE 


A 


ATOM 


699 


O 


ILE 


A 


AT^ 


700 


CB 


ILE 


A 


ATOM 


701 


CGI 


ILE 


A 


ATOM 


702 


CG2 


ILE 


A 


ATOM 


703 


GDI 


ILE 


A 


ATOM 


704 


N 


PRO 


A 


ATOM 


705 


CA 


PRO 


A 


ATOM 


706 


C 


PRO 


A 


ATOM 


707 


O 


PRO 


A 


ATOM 


708 


CB 


PRO 


A 


ATOM 


709 


CG 


PRO 


A 


ATOM 


710 


CD 


PRO 


A 


ATOM 


711 


N 


HIS 


A 


ATOM 


712 


CA 


HIS 


A 


ATOM 


713 


C 


HIS 


A 


ATOM 


714 


O 


HIS 


A 


ATOM 


715 


CB 


HIS 


A 


ATOM 


716 


CG 


HIS 


A 


ATOM 


717 


NDl 


HIS 


A 


ATOM 


718 


CD2 


HIS 


A 


ATOM 


719 


CEl 


HIS 


A 


ATOM 


720 


NE2 


HIS 


A 


ATOM 


721 


N 


GLY 


A 


ATOM 


722 


CA 


GLY 


A 


ATOM 


723 


C 


GLY 


A 


ATOM 


724 


0 


GLY 


A 


ATOM 


725 


N 


PRO 


A 


ATOM 


72 6 


CA 


PRO 


A 


ATOU 


727 


C 


PRO 


A 


ATOM 


728 


0 


PRO 


A 


ATOM 


729 


CB 


PRO 


A 


ATOM 


730 


CG 


PRO 


A 


ATOM 


731 


CD 


PRO 


A 


ATOM 


732 


N 


ASN 


A 


ATOM 


733 


CA 


ASN 


A 


ATOM 


734 


C 


ASN 


A 


ATOM 


735 


0 
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66 71.331 68.409 -2.913 

86 71.823 67.015 -3.293 

86 72.512 66.354 -2.509 

86 72.138 69.485 -3.642 

86 71.607 69.737 -4.930 

87 71.459 66.563 -4.494 
87 71.895 65.277 -5.006 
87 72.489 65.559 -6.384 
87 71.737 65.734 -7.354 
87 70.713 64.275 -5.094 
87 70.062 64.090 -3.729 
87 71.187 62.939 -5.631 

87 68.758 63.332 -3.747 

88 73.817 65.654 -6.453 
88 74.531 66.013 -7.689 
88 74.063 65.286 -8.956 
88 73.924 65.938 .-9.987 
88 75.971 65.632 -7.358 
88 76.067 65.895 -5.896 

88 74.762 65.455 -5.339 

89 73.857 63.972 -8.872 
89 73.332 63.162 -9.978 
89 71.871 62.815 -9.715 
89 71.449 61.661 -9.847 
89 74.173 61.907 -10.160 
89 75.632 62.184 -10.362 
89 76.120 62.833 -11.478 
89 76.708 61.905 -9.588 
89 77.435 62.933 -11.384 

89 77.817 62.376 -10.248 

90 71.120 63.846 -9.334 
90 69.696 63.769 -9.051 
90 69.005 64.963 -9.686 

90 69.524 65.524 -10.644 

91 67.861 65.382 -9.158 
91 67.175 66.565 -9.691 
91 67.987 67.835 -9.410 
91 68.764 67.852 -8.458 
91 65.837 66.579 -8.937 
91 66.049 65.738 -7.711 

91 67.164 64.807 -7.994 

92 67.809 68.863 -10.238 
92 68.496 70.152 -10.086 
92 67.841 71.060 -9.034 
92 67.368 72.156 -9.337 
92 68.546 70.860 -11.441 
92 69.438 72.079 -11.431 
92 70.604 72.003 -11.044 

92 68.895 73.217 -11.863 

93 67.830 70.592 -7.789 
93 67.205 71.310 -6.691 
93 68.043 71.217 -5.428 
93 68.907 70.353 -5.304 
93 65.794 70.772 -6.374 
93 64.868 70.960 -7.573 

93 65.848 69.310 -5.921 

94 67.772 72.139 -4.513 
94 58.320 72.119 -3.178 
94 67.170 72.293 -2.216 
94 66.283 73.119 -2.443 
94 69.327 73.252 -3.009 
94 70.459 73.016 -3.855 

94 69.910 73.256 -1.599 

95 67.162 71.515 -1.143 
95 66.110 71.652 -0.155 
95 66.660 71.686 1.261 
95 67.762 71.240 1.499 
95 65.071 70.544 -0.291 
95 64.479 70.568 -1.709 

95 65.663 69.183 0.025 

96 65.883 72.244 2.181 
96 66.212 72.215 3.597 
96 65.620 70.957 4.208 
96 64.402 70.809 4.302 
96 65.686 73.459 4.320 
96 65.976 73.474 5.835 
96 65.954 74.863 6.457 
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C 


ASP 


A 


223 


52 ,865 


42 , 194 


9.864 


1 . 00 


43 . 47 


A 


C 


ATOM 


1695 


0 


ASP 


A 


223 


52 . 008 


43 . 088 


9.777 


1.00 


45. 70 


A 


o 


ATOM 


1696 


CB 


ASP 


A 


223 


54 .339 


41 . 289 


11 . 693 


1.00 


50. 64 


A 


c 


ATOM 


1697 


CG 


ASP 


A 


223 


54 . 663 


42.585 


12 .348 


1 .00 


48. 82 


A 


c 


ATOM 


1698 


ODl 


ASP 


A 


223 


54 .041 


42.679 


13.392 


1.00 


56. 19 


A 


o 


ATGti 


1699 


0D2 


ASP 


A 


223 


55 . 483 


43.386 


11.871 


1.00 


54.79 


A 


0 


ATOM 


1700 


N 


LYS 


A 


224 


53.745 


41.920 


8.908 


1.00 


43.90 


A 


N 


ATOM 


1701 


CA 


LYS 


A 


224 


54 .128 


42.889 


7.872 


1.00 


43.50 


A 


c 


ATOM 


1702 


C 


LYS 


A 


224 


54 ,720 


42 .221 


6 . 651 


1.00 


38 . 65 


A 


C 


ATOM 


1703 


0 


LYS 


A 


224 


55 . 321 


41.177 


6.749 


1 ,00 


37.31 


A 


O 


ATOM 


1704 


CB 


LYS 


A 


224 


55 . 234 


43.791 


8 . 425 


1 . 00 


44 . 84 


A 


C 


ATOM 


1705 


CG 


LYS 


A 


224 


54 .814 


45 . 182 


8 . 824 


1 . 00 


49. 52 


A 


C 


ATOM 


1706 


CD 


LYS 


A 


224 


56 , 030 


46 . 109 


8 . 874 


1 . 00 


50 , 93 


A 


c 


ATOM 


1707 


CE 


LYS 


A 


224 


56 . 970 


45 . 783 


10 . 029 


1 . 00 


52. 43 


A 


c 


ATOM 


1708 


NZ 


LYS 


A 


224 


58 . 303 


45 . 329 


9 . 550 


1 . 00 


53.74 


A 


N 


ATOM 


1709 


N 


SER 


A 


225 


54 . 605 


42 . 855 


5.487 


1.00 


34. 67 


A 


N 


ATOM 


1710 


CA 


SER 


A 


225 


55 . 428 


42 .463 


4 .347 


1.00 


29.36 


A 


C 


ATOM 


1711 


C 


SER 


A 


225 


55 .923 


43.766 


3.727 


1.00 


23. 18 


A 


c 


ATOM 


1712 


O 


SER 


A 


225 


55 .109 


44. 637 


3.491 


1.00 


24.20 


A 


o 


ATOM 


1713 


CB 


SER 


A 


225 


54 . 634 


41 . 642 


3 . 319 


1 . 00 


33.54 


A 


c 


ATOM 


1714 


OG 


SER 


A 


225 


54 . 443 


40.303 


3 . 737 


1 .00 


31. 33 


A 


0 


ATOM 




N 


ILE 


A 


226 


57 , 235 


43 . 914 


3 . 514 


1 . 00 


24 , 65 


A 


N 


ATOM 


1 / Id 


CA 


ILE 


A 


226 


57 . 8 14 


45.209 


3 . 080 


1 , 00 


24 . 70 


A 


C 


ATOM 


17 17 


C 


ILE 


A 


226 


58 . 879 


45 . 132 


1. 973 


1. 00 


20, 50 


A 


C 


ATOM 


1718 


0 


ILE 


A 


226 


59 . 568 


44 . 138 


1 . 762 


1 . 00 


22. 18 


A 


o 


ATOM 


1719 


CB 


ILE 


A 


226 


58 . 378 


46.052 


4 .318 


1 .00 


26. 96 


A 


C 


ATOM 


1720 


CGI 


ILE 


A 


226 


59 • 715 


45 .495 


4 .804 


1 .00 


29.94 


A 


C 


ATOM 




CG2 


ILE 


A 


226 


57 . 350 


46. 153 


5.417 


1 . 00 


29. 80 


A 


C 


ATOM 


1722 


CDl 


ILE 


A 


226 


60.392 


46.347 


5.918 


1 .00 


30. 64 


A 


C 


ATOM 


1723 


N 


VAL 


A 


227 


59. 046 


46.229 


1.249 


1 .00 


21 .76 


A 


N 


ATOM 


1724 


CA 


VAL 


A 


227 


60 , 073 


46.314 


0 . 221 


1 .00 


20 ,70 


A 


C 


ATOM 


1725 


C 


VAL 


A 


227 


61 . 187 


47 . 177 


0 . 820 


1 . 00 


22 . 68 


A 


C 


ATOM 


1726 


0 


VAL 


A 


227 


60 . 949 


48 . 339 


1 . 114 


1 . 00 


25 . 66 


A 


o 


ATOM 


1727 


CB 


VAL 


A 


227 


59 . 486 


46 . 982 


-1 . 036 


1 .00 


22 . 09 


A 


c 


ATOM 


1728 


CGI 


VAL 


A 


227 


60 , 466 


46. 919 


-2,217 


1.00 


22.41 


A 


c 


ATOM 


1729 


CG2 


VAL 


A 


227 


58 , 172 


46.322 


-1.385 


1.00 


22.10 


A 


c 


ATOM 


1730 


N 


ASP 


A 


228 


62 . 377 


46.598 


1.025 


1.00 


25.94 


A 


N 


ATOM 


1731 


CA 


ASP 


A 


228 


63 . 488 


47 .250 


1.750 


1 .00 


26. 69 


A 


c 


ATOM 


1732 


C 


ASP 


A 


228 


64 .859 


47 . 181 


1.053 


1.00 


25. 68 


A 


c 




X r J J 


O 


ASP 


A 


228 


65 . 552 


46. 160 


1. 089 


1 . 00 


28 .14 


A 


o 




1 TiA 
X / J4 


CB 


ASP 


A 




63 . 610 


46 . 628 


3 . 151 


1 . 00 


30 . 80 


A 


c 




1 

J. / JD 


rrt 
v*c» 


ACD 
Aolr 


A 




64 . 507 


47 , 435 


4 . 073 


1 . 00 


36 . 12 


A 


c 


a TOM 


1 "7 "J C 


ODl 


ASP 


A 




65 ,240 


48 . 316 


3 . 575 


1 . 00 


32 . 68 


A 


0 


ATOM 


1737 


ono 


ASP 


A 




64 . 544 


47 . 261 


5 . 315 


1 . 00 


43 . 13 


A 


o 


HX UM 


J. / JO 


N 


SER 


A 


229 


65 . 273 


48 . 283 


0 . 448 


1 . 00 


20 . 40 


A 


N 


nlKJn 




A 
l^A 


CCD 


A 




oo . 5 J 4 


48 . 352 


-0 . 261 


1 . 00 


19 . 83 


A 


C 


ATOM 


1 T z rt 


C 


SER 


A 


229 


67 . 766 


48 . 269 


0 . 668 


1 . 00 


20. 00 


A 


C 


ATOM 


1741 


0 


SER 


A 


229 


68 . 848 


47 . 998 


0.202 


1 . 00 


20.19 


A 


0 


ATOM 


1742 


CB 


SER 


A 


229 


66. 608 


49. 641 


-1.084 


1. 00 


19.70 


A 


C 


ATOM 


1743 


OG 


SER 


A 


229 


66. 651 


50.793 


-0.239 


1.00 


20.04 


A 


0 


ATOM 


1744 


N 


GLY 


A 


230 


67 . 582 


48 .539 


1.955 


1.00 


24.02 


A 


N 


ATOM 


1745 


CA 


GLY 


A 


230 


68 , 666 


48 . 429 


2. 928 


1. 00 


30.23 


A 


C 


ATOM 


174 6 


C 


GLY 


A 


230 


69 . 016 


46 . 983 


3,270 


1. 00 


32.91 


A 


C 


ATOM 


1747 


O 


GLY 


A 


230 


70 , 179 


46 . 641 


3 .517 


1. 00 


37 .27 


A 


O 


ATOM 


1748 


N 


THR 


A 


231 


67 . 998 


46. 129 


3 .290 


1.00 


35.70 


A 


N 


ATOM 


T "7 ^ Q 


CA 


THR 


A 


231 


68 , 157 


44 . 736 


3 . 700 


1 . 00 


36. 19 


A 


C 


ATOM 


J. /ou 


C 


THR 


A 


231 


68 . 647 


43.943 


2 . 502 


1 . 00 


36, 17 


A 


C 


ATOM 


1 7 *;t 


O 


THR 


A 


231 


68 . 125 


44 . 098 


1 . 392 


1 , 00 


36.88 


A 


O 


ATOM 


X 


CB 


THR 


A 


231 


66 . 811 


44 , 203 


4 .216 


1 . 00 


36.06 


A 


C 


a TOM 


X / 9 J 


o»^i 


THR 


A 




66 . 371 


44 . 988 


5 .333 


1 . 00 


41 .33 


A 


0 


ATOM 


X / 94 




THR 


A 


£OX 


oo . 9 Jl 


42 • 806 


4 . 770 


1.00 


39. 41 


A 


C 


ATOM 




N 


THR 


A 




69. 676 


43 . 117 


2 .701 


1.00 


34 .96 


A 


N 


ATOM 


X f 9D 


CA 


THR 


A 


2 jZ 


70 . 213 


42 .299 


1 . 624 


1.00 


31.73 


A 


C 


ATOM 


X fO / 


/-« 

L> 


THR 


A 




69 • 299 


41 . Ill 


1 . 311 


1 . 00 


35.19 


A 


C 


ATUW 


1 T ci Q 
X fOo 


O 


THR 


A 


232 


69 , 149 


40,754 


0.150 


1.00 


34 .98 


A 


O 


ATOM 


1759 


CB 


THR 


TV 

A 




/ X . OXZ 


41 Tin 
HX , to/ 


1 O £0 


1.00 


36.41 


A 


c 


ATOM 


1760 


OGl 


THR 


A 


232 


72.502 


42.796 


2.338 


1.00 


33,05 


A 


O 


ATOM 


1761 


CG2 


THR 


A 


232 


72.274 


41.136 


0,735 


1.00 


36.22 


A 


c 


ATOM 


1762 


N 


ASN 


A 


233 


68,735 


40.502 


2.352 


1,00 


38.19 


A 


N 


ATOM 


1763 


CA 


ASN 


A 


233 


68.029 


39.230 


2.211 


1,00 


41.17 


A 


c 


ATC»4 


1764 


C 


ASN 


A 


233 


66.520 


39.365 


2.052 


1.00 


40,23 


A 


C 


ATOM 


1765 


o 


ASN 


A 


233 


65.922 


40.416 


2.327 


1.00 


39.76 


A 


0 


ATOM 


1766 


CB 


ASN 


A 


233 


68.307 


38.323 


3.420 


1.00 


43.40 


A 


C 


ATOM 


1767 


CG 


ASN 


A 


233 


69.767 


37.864 


3.503 


1.00 


44.47 


A 


C 


ATOM 


1768 


ODl 


ASN 


A 


233 


70.293 


37.678 


4.593 


1.00 


53.39 


A 


0 
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ATOM 


1769 


ND2 


ASN A 


233 


70 . 409 


37 . 667 


2.360 


1.00 


47 . 21 


A 


N 


ATOM 


1770 


N 


LEU A 


234 


65. 927 


38 .259 


1 . 613 


1 . 00 


39. 45 


A 


N 


ATOM 


1771 


CA 


LEU A 


234 


64 . 497 


38.025 


1 . 667 


1.00 


35 . 83 


A 


C 


ATOM 


1772 


C 


LEU A 


234 


64 . 178 


37.466 


3.035 


1.00 


31 ,21 


A 


C 


ATOM 


1773 


O 


LEU A 


234 


64 . 504 


36.342 


3.319 


1.00 


35. 53 


A 


O 


ATOM 


1774 


CB 


LEU A 


234 


64.119 


37.022 


0.562 


1.00 


41.31 


A 


C 


ATCU 


1775 


CG 


LED A 


234 


62 .727 


36.992 


-0.082 


1 .00 


42 . 04 


A 


C 


ATOM 


1776 


CDl 


LEU A 


234 


62 .447 


35.596 


-0 . 613 


1.00 


45 . 40 


A 


C 


ATOM 


1777 


C02 


LEtJ A 


234 


61 . 630 


37.429 


0.851 


1.00 


42.45 


A 


C 


ATOM 


1778 


N 


ARC A 


235 


63.564 


38.263 


3.906 


1.00 


39.51 


A 


N 


ATOM 


1779 


CA 


ARG A 


235 


63 .200 


37 .811 


5 .254 


1 . 00 


37 . 00 


A 


C 




1780 


C 


ARG A 


235 


61 . 737 


37 . 315 


5 . 290 


1 . 00 


38 . 00 


A 


C 


ATOM 


1781 


O 


ARG A 


235 


60 . 863 


37 . 918 


4 . 699 


1 .00 


31 . 41 


A 


0 


ATOM 


1782 


CB 


ARG A 


235 


63 . 434 


38 . 930 


6.278 


1.00 


40 . 50 


A 


c 


ATOM 


1783 


CG 


ARG A 


235 


64 .843 


39.557 


6.210 


1.00 


43 . 67 


A 


c 


ATOM 


1784 


CD 


ARG A 


235 


65 . 208 


40.478 


7 .378 


1.00 


46.74 


A 


c 


ATOM 


1785 


NE 


ARG A 


235 


65. 177 


39.774 


8 . 659 


1.00 


49.76 


A 


N 


ATOM 


1786 


CZ 


ARG A 


235 


64.729 


40.272 


9.823 


1 .00 


52 . 94 


A 


C 


ATOM 


1787 


NHl 


ARG A 


235 


64.272 


41.522 


9.918 


1.00 


51.18 


A 


N 


ATOM 


1788 


NH2 


ARG A 


235 


64.743 


39.503 


10.914 


1.00 


51.36 


A 


N 


ATOM 


1789 


N 


LEU A 


236 


61. 473 


36.226 


6.008 


1.00 


37.36 


A 


N 


ATOM 


1790 


CA 


LEU A 


236 


60 . 156 


35 . 571 


5 . 992 


1.00 


37.98 


A 


C 


ATOM 


1791 


C 


LEU A 


236 


59 . 691 


35 .254 


7 .408 


1 . 00 


39.27 


A 


C 


ATOM 


1792 


0 


LEU A 


236 


60 . 503 


34 . 847 


8 .230 


1 .00 


37 .29 


A 


O 


ATOM 


1793 


CB 


LEU A 


236 


60 . 238 


34 .271 


5 .210 


1 . 00 


37 . 66 


A 


C 


ATOM 


1794 


CG 


LEU A 


236 


60 . 689 


34 . 363 


3 .745 


1.00 


36.72 


A 


C 


ATOM 


1795 


CDl 


LEU A 


236 


60 . 713 


32 , 994 


3 . 135 


1 . 00 


34 . 34 


A 


C 


ATOM 


1796 


CD2 


LEU A 


236 


59 . 784 


35.269 


2 . 922 


1.00 


37 . 91 


A 


C 


ATOM 


1797 


N 


PRO A 


237 


58 .399 


35 .408 


7 . 719 


1.00 


37 . 12 


A 


N 


ATOM 


1798 


CA 


PRO A 


237 


57 • 939 


35.039 


9.061 


1.00 


35.72 


A 


C 


ATOM 


1799 


C 


PRO A 


237 


58 .324 


33 . 600 


9.304 


1.00 


36.04 


A 


C 


ATOM 


1800 


0 


PRO A 


237 


58 .364 


32.870 


8.326 


1.00 


31.43 


A 


0 


ATOM 


1801 


CB 


PRO A 


237 


56. 425 


35.222 


8.985 


1.00 


36.23 


A 


C 


ATOM 


1802 


CG 


PRO A 


237 


56.243 


36.263 


7 . 917 


1.00 


36.72 


A 


C 


ATOM 


1803 


CD 


PRO A 


237 


57 ,297 


35. 918 


6. 887 


1,00 


37.44 


A 


c 


ATOM 


1804 


N 


LYS A 


238 


58 . 647 


33.213 


10.538 


1.00 


34 .49 


A 


N 


ATOM 


1805 


CA 


LYS A 


238 


59 . 098 


31 . 839 


10. 805 


1,00 


34.56 


A 


C 


ATOM 


1806 


C 


LYS A 


238 


58 . 322 


30 .780 


10 . 051 


1.00 


32.78 


A 


c 


ATOM 


1807 


0 


LYS A 


238 


58 . 908 


29 . 860 


9. 452 


1 .00 


32 . 03 


A 


o 


ATOM 


1808 


CB 


LYS A 


238 


58 . 881 


31 . 433 


12.243 


1.00 


34 .24 


A 


c 


ATOM 


1809 


CG 


LYS A 


238 


59. 299 


32.363 


13.281 


1.00 


30.75 


A 


c 


ATOM 


1810 


CD 


LYS A 


238 


58 .868 


31.768 


14 .539 


1.00 


5.80 


A 


c 


ATOM 


1811 


CE 


LYS A 


238 


57. 493 


31 .846 


14 .975 


1.00 


27.32 


A 


c 


ATOM 


1812 


NZ 


LYS A 


238 


57.007 


31.123 


16.239 


1.00 


32.55 


A 


N 


ATOM 


1813 


N 


LYS A 


239 


56.998 


30.879 


10.142 


1.00 


28.36 


A 


N 


ATOM 


1814 


CA 


LYS A 


239 


56. 149 


29.747 


9.735 


1.00 


37 .49 


A 


c 


ATOM 


1815 


C 


LYS A 


239 


56. 292 


29 .508 


8 .233 


1.00 


35. 49 


A 


c 


ATOM 


1816 


0 


LYS A 


239 


56.310 


28 .360 


7.7 62 


1.00 


30.89 


A 


o 


ATOM 


1817 


CB 


LYS A 


239 


54 . 675 


29 . 990 


10 . 108 


1 .00 


39.29 


A 


c 


ATOM 


1818 


CG 


LYS A 


239 


54 . 110 


28 . 997 


11 . 108 


1.00 


46.03 


A 


c 


ATOM 


1819 


CD 


LYS A 


239 


52 . 700 


29. 406 


11.566 


1.00 


48. 69 


A 


c 


ATOM 


1820 


C£ 


LYS A 


239 


51 . 634 


28 . 404 


11 . 117 


1.00 


50.81 


A 


c 


TV TOM 


XO^ J. 


NZ 


LYS A 


239 


50 . 243 


28.846 


11.463 


1.00 


49. 91 


A 


N 


TV TOM 


1 POO 


N 


V7VL A 


240 


56. 411 


30. 614 


7.497 


1.00 


32 . 65 


A 


N 


ATOM 


lozo 


CA 


VAL A 


240 


56. 599 


30.569 


6.057 


1.00 


32 .95 


A 


c 


ATOM 




C 


VAL A 


240 


58.018 


30 . 172 


5. 684 


1.00 


28 .03 


A 


c 


ATOM 


1825 


0 


VAL A 


240 


58.201 


29 . 484 


4 . 704 


1.00 


26.73 


A 


0 


ATOM 


1826 


CB 


VAL A 


240 


5 6 . 323 


31 . 928 


5 .390 


1.00 


28 .86 


A 


c 


ATOM 


1827 


CGI 


VAL A 


240 


56 . 411 


31.801 


3 . 908 


1.00 


29.02 


A 


c 


ATOM 


1826 


CG2 


VAL A 


240 


54 , 963 


32 . 517 


5 .818 


1.00 


33 , 40 


A 


c 


ATOM 


1829 


N 


PHE A 


241 


59 . 019 


30 . 655 


6.430 


1 .00 


31.69 


A 


N 


ATOM 


1630 


CA 


PHE A 


241 


60 . 402 


30.267 


6. 159 


1.00 


34 .11 


A 


C 


ATOM 


1831 


C 


PHE A 


241 


60. 559 


28 .745 


6.338 


1.00 


32.98 


A 


C 


ATOM 


1832 


0 


PHE A 


241 


61 .100 


28 .029 


5.470 


1.00 


29.40 


A 


o 


ATOM 


1833 


CB 


PHE A 


241 


61 .360 


30.997 


7.095 


1.00 


35.69 


A 


C 


ATOM 


1834 


CG 


PHE A 


241 


62 .746 


30.459 


7.057 


1.00 


39.37 


A 


C 


ATOM 


1 Q C 


CDl 


PHE A 


241 


63 . 601 


30.785 


6. 012 


1.00 


42 .58 


A 


C 


ATOM 


1836 


CD2 


PHE A 


241 


63 . 197 


29 . 583 


8.047 


1 nn 




A 


C 


ATOM 


1837 


CEl 


PHE A 


241 


64.895 


30,272 


5.968 


1.00 


43,06 


A 


c 


ATOM 


1838 


CE2 


PHE A 


241 


64.479 


29.073 


8.007 


1.00 


38,87 


A 


c 


ATOM 


1839 


CZ 


PHE A 


241 


65.329 


29.419 


6.964 


1.00 


43,23 


A 


c 


ATOM 


1840 


N 


GLU A 


242 


60.075 


28.249 


7.468 


1.00 


32.78 


A 


N 


ATOM 


1841 


CA 


GLU A 


242 


60.011 


26.800 


7,696 


1.00 


36.60 


A 


C 


ATOM 


1842 


C 


GLU A 


242 


59.505 


26.025 


6.487 


1.00 


33.79 


A 


c 


ATCW 


1843 


0 


GLU A 


242 


60.160 


25,083 


6.053 


1.00 


35.48 


A 


o 


ATOM 


1844 


CB 


GLU A 


242 


59.123 


26.473 


8.899 


1.00 


37.43 


A 


c 


ATOM 


1845 


CG 


GLU A 


242 


59.630 


26.686 


10.217 


1.00 


43.87 


A 


c 
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ATOM 


1846 


CD 


GLU 


A 


242 


60.878 


25.635 


10.508 


1.00 


45.01 


A 


C 


ATOM 


1847 


OEl 


GLU 


A 


242 


61.759 


25.906 


11.358 


1.00 


45.07 


A 


o 


ATOM 


1848 


0E2 


GLU 


A 


242 


60.818 


24.545 


9.888 


1.00 


52 .08 


A 


o 


ATOM 


1849 


N 


AIiA 


A 


243 


58.358 


26.437 


5.942 


1.00 


34 . 61 


A 


N 


ATOM 


1850 


CA 


ALA 


A 


243 


57.752 


25.723 


4.804 


1.00 


36. 67 


A 


c 


ATOM 


1851 


C 


ALA 


A 


243 


58.531 


25.984 


3.523 


1.00 


35 . 54 


A 


c 


ATOM 


1852 


0 


ALA 


A 


243 


58.735 


25.093 


2.706 


1.00 


30. 69 


A 


o 


ATOM 


1853 


CB 


ALA 


A 


243 


56.307 


26.138 


4. 615 


1.00 


36.74 


A 


c 


ATOM 


1854 


N 


ALA 


A 


244 


58.961 


27.231 


3.375 


1.00 


36.29 


A 


N 


ATOM 


1855 


CA 


ALA 


A 


244 


59.717 


27.682 


2.224 


1.00 


36.28 


A 


C 


ATOM 


1856 


C 


ALA 


A 


244 


60.970 


26.841 


2.063 


1.00 


36.59 


A 


C 


ATOM 


1857 


0 


ALA 


A 


244 


61.133 


26.184 


1.058 


1.00 


35.35 


A 


0 


ATOM 


1858 


CB 


ALA 


A 


244 


60.073 


29.142 


2.383 


1.00 


34 . 54 


A 


C 


ATOM 


1859 


N 


VAL 


A 


245 


61.853 


26.884 


3.064 


1.00 


38. 90 


A 


N 


ATOM 


1860 


CA 


VAL 


A 


245 


63.002 


25.982 


3.143 


1.00 


42 .56 


A 


C 


ATOM 


1861 


C 


VAL 


A 


245 


62. 658 


24 .500 


2 .899 


1.00 


40.13 


A 


C 


ATOM 


1862 


0 


VAL 


A 


245 


63.341 


23.835 


2.114 


1.00 


40.38 


A 


o 


ATOM 


1863 


CB 


VAL 


A 


245 


63 .742 


26. 130 


4.515 


1 .00 


43. 37 


A 


c 


ATOM 


1864 


CGI 


VAL 


A 


245 


64 .651 


24 .918 


4.821 


1 .00 


47 .23 


A 


c 


ATOM 


1865 


CG2 


VAL 


A 


245 


64.541 


27 .420 


4.534 


1.00 


44.13 


A 


c 


ATOM 


1866 


N 


LYS 


A 


246 


61 . 627 


23.974 


3.556 


1.00 


39.54 


A 


N 


ATOM 


1867 


CA 


LYS 


A 


246 


61.270 


22.556 


3.352 


1.00 


44.44 


A 


C 


ATOM 


1868 


C 


LYS 


A 


246 


61.172 


22.215 


1.859 


1.00 


43.12 


A 


C 


ATOM 


1869 


0 


LYS 


A 


246 


61.745 


21 .233 


1 . 407 


1 . 00 


40 . 51 


A 


O 


ATOM 


1870 


CB 


LYS 


A 


246 


59. 965 


22 . 180 


4.068 


1.00 


46.80 


A 


C 


ATOM 


1871 


CG 


LYS 


A 


246 


59.575 


20.695 


3,924 


1,00 


50.38 


A 


C 


ATOM 


1872 


CD 


LYS 


A 


246 


58.263 


20.380 


4.653 


1.00 


52.29 


A 


C 


ATOM 


1873 


CE 


LYS 


A 


246 


57.530 


19.182 


4 .042 


1.00 


54.52 


A 


C 


ATOM 


1874 


NZ 


LYS 


A 


246 


58.452 


18.072 


3.656 


1.00 


53.80 


A 


N 


ATOM 


1875 


N 


SER 


A 


247 


60.473 


23.051 


1.097 


1.00 


44.68 


A 


N 


ATOM 


1876 


CA 


SBR 


A 


247 


60.282 


22.809 


-0.337 


1.00 


45.24 


A 


C 


ATOM 


1877 


C 


SER 


A 


247 


61.505 


23.062 


-1.226 


1.00 


43.18 


A 


c 


ATOM 


1878 


0 


SER 


A 


247 


61, 653 


22.423 


-2.258 


1.00 


38.59 


A 


0 


ATOM 


1879 


CB 


SER 


A 


247 


59.126 


23.654 


-0.869 


1.00 


45.97 


A 


c 


ATOM 


1880 


OG 


SER 


A 


247 


59.035 


23.478 


-2.266 


1.00 


42 . 87 


A 


o 


ATOM 


1881 


N 


ILE 


A 


248 


62 . 345 


24 .027 


-0.861 


1.00 


47 . 83 


A 


N 


ATOM 


1882 


CA 


ILE 


A 


248 


63 . 534 


24 .348 


-1.658 


1.00 


50 .81 


A 


C 


ATOM 


1883 


C 


ILE 


A 


248 


64 . 570 


23.241 


-1.475 


1.00 


52 . 35 


A 


C 


ATOM 


1884 


0 


ILE 


A 


248 


65 .200 


22.787 


-2.440 


1.00 


48 . 20 


A 


o 


ATOM 


1885 


CB 


ILE 


A 


248 


64 . 116 


25.716 


-1.260 


1 . 00 


51.35 


A 


c 


ATOM 


1886 


CGI 


ILE 


A 


248 


63. 101 


26.823 


-1.548 


1. 00 


50. 95 


A 


c 


ATOM 


1887 


CG2 


ILE 


A 


248 


65.428 


25.983 


-2.015 


1.00 


51.10 


A 


c 


ATOM 


1888 


CDl 


ILE 


A 


248 


63.447 


28.154 


-0.913 


1.00 


51.52 


A 


c 


ATOM 


1689 


N 


LYS 


A 


249 


64 .725 


22.814 


-0.227 


1.00 


53-72 


A 


N 


ATOM 


1890 


CA 


LYS 


A 


249 


65.451 


21.585 


0.108 


1.00 


59.31 


A 


C 


ATOM 


1891 


C 


LYS 


A 


249 


65.021 


20.377 


-0,745 


1.00 


61.00 


A 


C 


ATOM 


1892 


0 


LYS 


A 


249 


65.871 


19.634 


-1.233 


1.00 


58 .84 


A 


0 


ATOM 


1893 


CB 


LYS 


A 


249 


65.260 


21.257 


1.598 


1.00 


61.17 


A 


C 


ATOM 


1894 


CG 


LYS 


A 


249 


66.419 


20.522 


2.240 


1.00 


63.43 


A 


C 


ATOM 


1895 


CD 


LYS 


A 


249 


66.187 


20.317 


3.740 


1.00 


65.86 


A 


c 


ATOM 


1896 


CE 


LYS 


A 


249 


66.299 


21.620 


4 .530 


1.00 


67 . 19 


A 


c 


ATOM 


1897 


KZ 


LYS 


A 


249 


66.791 


21.417 


5.929 


1.00 


68.30 


A 


N 


ATOM 


1898 


N 


ALA 


A 


250 


63.711 


20.207 


-0.942 


1.00 


63.51 


A 


N 


ATOM 


1899 


CA 


ALA 


A 


250 


63.160 


19.006 


-1.589 


1.00 


66.01 


A 


C 


ATOM 


1900 


C 


ALA 


A 


250 


63.564 


18.839 


-3.059 


1.00 


67.43 


A 


C 


ATOM 


1901 


0 


ALA 


A 


250 


64.214 


17.861 


-3.408 


1 .00 


67 .59 


A 


O 


ATOM 


1902 


CB 


ALA 


A 


250 


61 . 635 


18.974 


-1.455 


1 .00 


65 . 47 


A 


c 


ATOM 


1903 


N 


ALA 


A 


251 


63 . 185 


19.783 


-3.917 


1,00 


69 . 93 


A 


N 


ATOM 


1904 


CA 


ALA 


A 


251 


63,539 


19. 694 


-5.342 


1 . 00 


70. 60 


A 


C 


ATOM 


1905 


C 


ALA 


A 


251 


64 . 985 


20 .123 


-5.633 


1 , 00 


70 . 50 


A 


C 


ATOM 


1906 


0 


ALA 


A 


251 


65 .364 


20 .268 


-6.794 


1 . 00 


69.29 


A 


o 


ATOM 


1907 


CB 


ALA 


A 


251 


62 .547 


20.488 


-6.212 


1,00 


70.96 


A 


C 


ATOM 


1908 


N 


SER 


A 


252 


65 .778 


20.338 


-4 .582 


1 .00 


70. 95 


A 


N 


ATOM 


1909 


CA 


SER 


A 


252 


67 .213 


20.562 


-4.718 


1.00 


72.23 


A 


C 


ATOM 


1910 


C 


SER 


A 


252 


68.016 


19.483 


-3.985 


1.00 


73.46 


A 


C 


ATOM 


1911 


0 


SER 


A 


252 


69.189 


19.680 


-3.661 


1.00 


71.37 


A 


o 


ATOM 


1912 


CB 


SER 


A 


252 


67.582 


21.951 


-4.189 


1.00 


72. 68 


A 


C 


ATOM 


1913 


OG 


SBR 


A 


252 


67 , 505 


21 . 999 


-2.775 


1 . 00 


73 , 18 


A 


o 


ATOM 


1914 


N 


SER 


A 


253 


67.389 


18.332 


-3.756 


1.00 


75.44 


A 


N 


ATOM 


1915 


CA 


SER 


A 


253 


68.011 


17.239 


-3.011 


1.00 


77.44 


A 


C 


ATOM 


1916 


C 


SER 


A 


253 


69.079 


16.491 


-3.819 


1.00 


79.67 


A 


C 


ATOM 


1917 


0 


SER 


A 


253 


69.783 


15.645 


-3.263 


1.00 


79.72 


A 


0 


ATOM 


1918 


CB 


SER 


A 


253 


66.944 


16.250 


-2.532 


1.00 


77.46 


A 


C 


ATOM 


1919 


OG 


SER 


A 


253 


66.037 


16.870 


-1.637 


1.00 


76.30 


A 


o 


ATOM 


1920 


N 


THR 


A 


254 


69.196 


16.799 


-5.116 


1.00 


81.88 


A 


N 


ATOM 


1921 


CA 


THR 


A 


254 


70.232 


16.215 


-5.983 


1.00 


83.80 


A 


C 


ATOM 


1922 


C 


THR 


A 


254 


71.624 


16.245 


-5.334 


1.00 


85.34 


A 


C 
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ATOM 


1923 


O 


THR 


A 


254 


72.423 


15.330 


-5.538 


1 . 00 


85 . 86 


A 


O 


ATOM 


1924 


CB 


THR 


A 


254 


70.270 


16.936 


-7.360 


1 . 00 


83 . 52 


A 


C 


ATOM 


1925 


OGl 


THR 


A 


254 


68.992 


16.851 


-7 .999 


1 . 00 


83 .21 


A 


0 


ATOM 


1926 


CG2 


THR 


A 


254 


71.205 


16.228 


-8.342 


1 . 00 


83 . 62 


A 


c 


ATOM 


1927 


N 


GLU 


A 


255 


71.909 


17.296 


-4 .565 


i-.oo 


86.72 


A 


N 


ATOM 


1928 


OA 


GLU 


A 


255 


73.121 


17.354 


-3.746 


1 .00 


87 ,75 


A 


C 


ATOM 


1929 


C 


GLU 


A 


255 


72.785 


17.714 


-2 . 302 


1 .00 


88 . 51 


A 


C 


ATOM 


1930 


0 


GLU 


A 


255 


72.017 


18.643 


-2.048 


1.00 


89.05 


A 


O 


ATOM 


1931 


CB 


GLU 


A 


255 


74.103 


18.379 


-4,307 


1.00 


87,94 


A 


C 


ATOM 


1932 


CG 


GLU 


A 


255 


74.553 


18.101 


-5.731 


1.00 


88 .21 


A 


C 


ATOM 


1933 


CD 


GLU 


A 


255 


75.403 


19.222 


-6-297 


1,00 


88.85 


A 


C 


ATOM 


1934 


OEl 


GLU 


A 


255 


76.162 


19.847 


-5.521 


1.00 


88 . 18 


A 


0 


ATOM 


1935 


0E2 


GLU 


A 


255 


75.308 


19.478 


-7.518 


1 .00 


89. 16 


A 


o 


ATOM 


1936 


N 


LYS 


A 


256 


73.367 


16.973 


-1.361 


1 .00 


89. 37 


A 


N 


ATOM 


1937 


CA 


LYS 


A 


256 


73 . 181 


17.233 


0.066 


1 .00 


89 . 70 


A 


C 


ATOM 


1938 


C 


LYS 


A 


256 


74.060 


18.406 


0.515 


1 . 00 


89 , 43 


A 


C 


ATOM 


1939 


O 


LYS 


A 


256 


75.211 


18.519 


0.090 


1 .00 


90 . 96 


A 


0 


ATOM 


1940 


CB 


LYS 


A 


256 


73 .524 


15.976 


0.878 


1 .00 


89 .79 


A 


C 


ATOM 


1941 


CG 


LYS 


A 


256 


73.344 


16.118 


2.390 


1 .00 


89 . 80 


A 


C 


ATOM 


1942 


CD 


LYS 


A 


256 


73.488 


14.778 


3.106 


1 .00 


89.76 


A 


C 


ATOM 


1943 


CE 


LYS 


A 


256 


74.916 


14.250 


3.037 


1.00 


89. 66 


A 


c 


ATOM 


1944 


NZ 


LYS 


A 


256 


75.135 


13.099 


3.955 


1.00 


89.59 


A 


N 


ATOM 


1945 


N 


PHE 


A 


257 


73.509 


19.278 


1,359 


1.00 


88.33 


A 


N 


ATOM 


1946 


CA 


PHE 


A 


257 


74 .277 


20.358 


1. 988 


1.00 


87, 80 


A 


C 


ATOM 


1947 


C 


PHE 


A 


257 


73.957 


20,434 


3 .486 


1.00 


85 . 90 


A 


C 


ATOM 


1948 


0 


PHE 


A 


257 


72 . 901 


19.963 


3 . 916 


1 . 00 


84 . 62 


A 


O 


ATOM 


1949 


CB 


PHE 


A 


257 


73.977 


21, 698 


1 .307 


1 .00 


88 .75 


A 


C 


ATOM 


1950 


CG 


PHE 


A 


257 


74 . 158 


21. 672 


-0 .188 


1 . 00 


90 . 18 


A 


C 


ATOM 


1951 


GDI 


PHE 


A 


257 


73. 128 


22.071 


-1 .035 


1 . 00 


90 . 73 


A 


C 


ATOM 


1952 


CD2 


PHE 


A 


257 


75. 358 


21.243 


-0.747 


1.00 


90. 65 


A 


C 


ATOM 


1953 


CEl 


PHE 


A 


257 


73.295 


22.043 


-2,417 


1.00 


91.50 


A 


c 


ATOM 


1954 


CE2 


PHE 


A 


257 


75.530 


21.208 


-2,125 


1.00 


91,35 


A 


c 


ATOM 


1955 


CZ 


PHE 


A 


257 


74,499 


21.611 


-2.961 


1.00 


91. 69 


A 


c 


ATOM 


1956 


N 


PRO 


A 


258 


74.857 


21.019 


4.282 


1.00 


84.41 


A 


N 


ATOM 


1957 


CA 


PRO 


A 


258 


74. 671 


21.079 


5.743 


1. 00 


83.94 


A 


C 


ATOM 


1958 


C 


PRO 


A 


258 


73.334 


21.697 


6.182 


1.00 


83,10 


A 


C 


ATOM 


1959 


0 


PRO 


A 


258 


72.764 


22.519 


5.459 


1.00 


82 . 64 


A 


O 


ATOM 


1960 


CB 


PRO 


A 


258 


75.840 


21.957 


6.218 


1.00 


83.97 


A 


C 


ATOM 


1961 


CG 


PRO 


A 


258 


76. 862 


21.878 


5.141 


1.00 


84 , 54 


A 


C 


ATOM 


1962 


CD 


PRO 


A 


258 


76. 116 


21. 664 


3.861 


1.00 


84 .44 


A 


c 


ATOM 


1963 




ASP 


A 


259 


72.852 


21.302 


7 .360 


1.00 


81,28 


A 


N 


ATOM 


1964 


CA 


ASP 


A 


259 


71, 608 


21.847 


7.916 


1.00 


79.36 


A 


c 


ATOM 


1965 


C 


ASP 


A 


259 


71,767 


23.327 


8.300 


1.00 


76.80 


A 


c 


ATOM 


1966 


0 


ASP 


A 


259 


70,804 


24.097 


8.228 


1.00 


75.77 


A 


O 


ATOM 


1967 


CB 


ASP 


A 


259 


71. 140 


21.025 


9.133 


1 .00 


80.43 


A 


c 


ATOM 


1968 


CG 


ASP 


A 


259 


69.749 


20.420 


8 .947 


1 .00 


81. 62 


A 


c 


ATOM 


1969 


ODl 


ASP 


A 


259 


69. 433 


19.944 


7 .832 


1 . 00 


81 . 91 


A 


o 


ATOM 


1970 


OD2 


ASP 


A 


259 


68 . 906 


20.364 


9.870 


1 . 00 


82 . 81 


A 


O 


ATOM 


1971 


N 


GLY 


A 


260 


72. 981 


23 .716 


8 . 694 


1 . 00 


72 . 15 


A 


N 


ATOM 


1972 


CA 


GLY 


A 


260 


73.280 


25.095 


9.051 


1.00 


68 .23 


A 


c 


ATOM 


1973 


C 


GLY 


A 


260 


73.394 


26.055 


7 .873 


1. 00 


65,01 


A 


c 


ATOM 


1974 


O 


GLY 


A 


260 


73,306 


27.266 


8.055 


1.00 


62, 06 


A 


o 


ATOM 


1975 


N 


PHE 


A 


261 


73.601 


25.529 


6.670 


1.00 


61.00 


A 


N 


ATOM 


1976 


CA 


PHE 


A 


261 


73.582 


26.350 


5.456 


1.00 


59.12 


A 


c 


ATOM 


1977 


C 


PHE 


A 


261 


72.247 


27.101 


5.311 


1.00 


57.30 


A 


c 


ATOM 


1978 


O 


PHE 


A 


261 


72.217 


28.296 


5.013 


1.00 


47.15 


A 


o 


ATOM 


1979 


CB 


PHE 


A 


261 


73.833 


25.475 


4 .222 


1.00 


59.54 


A 


c 


ATOM 


1980 


CG 


PHE 


A 


261 


73 . 504 


26.148 


2 .920 


1 . 00 


58 .56 


A 


c 


ATOM 


1981 


CDl 


PHE 


A 


261 


74 . 289 


27 .187 


2.447 


1 . 00 


57 .89 


A 


c 


ATOM 


1982 


CD2 


PHE 


A 


261 


72 . 415 


25 .741 


2.169 


1 . 00 


58 .88 


A 


c 


ATOM 


1983 


CEl 


PHE 


A 


261 


73 . 996 


27.807 


1 .259 


1 . 00 


56.77 


A 


c 


ATOM 


1984 


CE2 


PHE 


A 


261 


72 . 118 


26.359 


0. 972 


1 . 00 


58 .83 


A 


c 


ATOM 


1985 


CZ 


PHE 


A 


261 


72 .912 


27.393 


0.516 


1 . 00 


58 . 32 


A 


c 


ATOM 


1986 


N 


TRP 


A 


262 


71,151 


26.389 


5.557 


1 . 00 


56.79 


A 


N 


ATOM 


1987 


CA 


TRP 


A 


262 


69.813 


26.949 


5,396 


1 . 00 


58 .03 


A 


c 


ATOM 


1988 


C 


TRP 


A 


262 


69.475 


27.995 


6,467 


1.00 


58 . 02 


A 


c 


ATOM 


1989 


0 


TRP 


A 


262 


68.525 


28.760 


6.296 


1 , 00 


57 , 45 


A 


o 


ATOM 


1990 


CB 


TRP 


A 




68 . 759 


25 . 832 


5 . 406 


J. « uu 


0^ , 1.U 


A 


c 


ATOM 


1991 


CG 


TRP 


A 


262 


69.026 


24.721 


4.432 


1.00 


60.40 


A 


c 


ATOM 


1992 


CDl 


TRP 


A 


262 


69.372 


23.430 


4.730 


1.00 


61.96 


A 


c 


ATOM 


1993 


CD2 


TRP 


A 


262 


68.974 


24.800 


3.003 


1.00 


61.91 


A 


c 


ATOM 


1994 


NEl 


TRP 


A 


262 


69.535 


22.704 


3.574 


1.00 


60.97 


A 


N 


ATOM 


1995 


CE2 


TRP 


A 


262 


69.298 


23.520 


2.498 


1,00 


61.58 


A 


c 


ATOM 


1996 


CE3 


TRP 


A 


262 


68.688 


25.824 


2.092 


1.00 


62.89 


A 


c 
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85.006 27.012 -11.926 1.00 

86.850 28.443 -10.131 1.00 

83.408 28.289 -10.977 1.00 

82.523 28.301 -12.132 1.00 

81.761 27.008 -12.373 1.00 

81.383 26.720 -13.511 1.00 

81.509 26.250 -11.305 1.00 

80.861 24.940 -11.396 1.00 

79.570 24.921 -10.581 1.00 

79.158 23.880 -10.064 1.00 

81.812 23.828 -10.895 1.00 

82.135 24,043 -9.514 1.00 

83.162 23.879 -11.618 1.00 

78.929 26.079 -10.480 1.00 

77.719 26.213 -9.688 1.00 

76.535 25.616 -10.451 1.00 

76.311 25.959 -11.614 1.00 

77.450 27.690 -9.370 1.00 

78.649 28.313 -8.889 1.00 

76.472 27.827 -8.210 1.00 

75.790 24.718 -9.808 1.00 

74.591 24.138 -10.420 1.00 

73.386 25.087 -10.330 1.00 

72.759 25.213 -9.270 1.00 

74.363 22.864 -9.599 1.00 
74.945 23.160 -8.247 1.00 
76.033 24.178 -8.456 1.00 
73.084 25.757 -11.441 1.00 
71.945 26.671 -11.512 1.00 
70.645 25.914 -11.259 1.00 
69.868 26.253 -10.361 1.00 
71.852 27.327 -12.903 1.00 
72.863 28.426 -13.258 1.00 
73.299 28.743 -14.520 1.00 
73.518 29.358 -12.371 1.00 
74.186 29.792 -14.473 1.00 
74.340 30.190 -13.171 1.00 
73.503 29.573 -10.982 1.00 
75.129 31.211 -12.632 1.00 
74.291 30.587 -10.449 1.00 
75.092 31.392 -11.273 1.00 

70.450 24.864 -12.052 1.00 
69.149 24.232 -12.228 1.00 
68.678 23.332 -11.094 1.00 
67.516 22.952 -11.057 1.00 
69.145 23.456 -13.544 1.00 
69.124 24.372 -14.749 1.00 
68.090 24.528 -15.397 1.00 
70.261 25.002 -15.043 1.00 
69.567 23.002 -10.164 1.00 
69.188 22.191 -9.006 1.00 
68.277 22.978 -8.046 1.00 
67.509 22.382 -7.293 1.00 

70.451 21.673 -8.266 1.00 
71.311 20.801 -9.196 1.00 
70.069 20.890 -7.004 1.00 
70.601 19.582 -9.773 1.00 

68.364 24.310 -8.089 1.00 
67.577 25.184 -7.215 1.00 
66.327 25.723 -7.929 1.00 
66.404 26.160 -9.065 1.00 
68.440 26.352 -6.737 1.00 
69.641 25.934 -5.950 1,00 
70.860 25.735 -6.582 1.00 
69,554 25.741 -4.578 1.00 
71.975 25.351 -5.861 1.00 
70.663 25.351 -3.846 1.00 
71.880 25.158 -4.487 1.00 
65.183 25.713 -7.253 1.00 
63.933 26.153 -7.873 1.00 
63.830 27.670 -7.930 1.00 
64.540 28.377 -7.209 1.00 
62.875 25.625 -6.911 1.00 
63.552 25.707 -5.574 1.00 
64.985 25.312 -5.845 1.00 
62.942 28.158 -8.783 1.00 
62.594 29.567 -8.785 1.00 
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ATC»4 


2449 


N 


CYS 


A 


319 


82 . 065 


30.653 


-3.259 


1.00 


73.01 


A 


N 


ATOM 


2450 


CA 


CYS 


A 


319 


81.211 


30.428 


-2.094 


1.00 


71.05 


A 


c 


ATOM 


2451 


C 


CYS 


A 


319 


80.044 


31.414 


-2.062 


1.00 


68 . 67 


A 


c 


ATOM 




O 


CYS 


A 




du , Zoo 


32 . 611 


-2 . 258 


1 . 00 


OO . OO 


A 


0 


ATOM 


2453 


CB 


CYS 


A 


319 


82.026 


30.554 


-0.803 


1.00 


71.92 


A 


c 


ATOM 


2454 


SG 


CYS 


A 


319 


83.414 


29.394 


-0.686 


1.00 


74.09 


A 


S 


ATOM 


2455 


N 


TYR 


A 


320 


78.839 


30.899 


-1.832 


1.00 


66.44 


A 


N 


ATOM 


2456 


CA 


TYR 


A 


320 


77.645 


31.728 


-1.691 


1.00 


64.22 


A 


c 


ATOM 


2457 


C 


TYR 


A 


320 


77.013 


31.498 


-0.320 


1.00 


62.66 


A 


c 


ATOM 


2458 


O 


TYR 


A 


320 


77.332 


30.523 


0.365 


1.00 


61.41 


A 


0 


ATOM 


2459 


CB 


TYR 


A 


320 


76.611 


31.384 


-2.764 


1,00 


64.24 


A 


c 


ATOM 


2460 


CG 


TYR 


A 


320 


77.108 


31.323 


-4.198 


1.00 


64.74 


A 


c 


ATOM 


2461 


CDl 


TYR 


A 


320 


77.957 


30.302 


-4.632 


1.00 


64.73 


A 


c 
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ATOM 


2462 


CD2 


TYR 


A 


320 


76.682 


32.260 


-5.139 


1.00 


64.78 


A 


C 


ATOM 


2463 


CEl 


TYR 


A 


320 


78.390 


30.239 


-5.961 


1.00 


64.57 


A 


c 


ATOM 


2464 


CE2 


TYR 


A 


320 


77.106 


32.202 


-6.465 


1.00 


64.30 


A 


c 


ATOM 


2465 


CZ 


TYR 


A 


320 


77.957 


31.193 


-6.871 


1.00 


64.35 


A 


c 


ATOM 


2466 


OH 


TYR 


A 


320 


78.374 


31.146 


-8.185 


1.00 


62.89 


A 


O 


ATOM 


2467 


N 


LYS 


A 


321 


76.111 


32.392 


0.076 


1.00 


59.11 


A 


N 


ATOM 


2468 


CA 


LYS 


A 


321 


75.320 


32.189 


1.286 


1.00 


58.36 


A 


c 


ATOM 


2469 


C 


LYS 


A 


321 


73.816 


32.265 


0.996 


1.00 


55.24 


A 


c 


ATOM 


2470 


O 


LYS 


A 


321 


73.353 


33.065 


0.181 


1.00 


47.87 


A 


o 


ATOM 


2471 


CB 


LYS 


A 


321 


75.747 


33.160 


2.398 


1.00 


59.88 


A 


c 


ATOM 


2472 


CG 


LYS 


A 


321 


74.853 


34.371 


2.629 


1.00 


62.72 


A 


c 


ATOM 


2473 


CD 


LYS 


A 


321 


75.277 


35.120 


3.888 


1.00 


64.37 


A 


c 


ATOM 


2474 


CE 


LYS 


A 


321 


74.699 


34.483 


5.145 


1.00 


64.91 


A 


c 


ATOM 


2475 


NZ 


LYS 


A 


321 


74.999 


35,289 


6.363 


1.00 


66.23 


A 


N 


ATOM 


2476 


N 


PHE 


A 


322 


73,062 


31.401 


1.664 


1.00 


51.88 


A 


N 


ATOM 


2477 


CA 


PHE 


A 


322 


71.619 


31.397 


1,531 


1.00 


49.73 


A 


C 


ATOM 


2478 


C 


PHE 


A 


322 


71,087 


32.737 


2.019 


1.00 


49.06 


A 


C 


ATOM 


2479 


0 


PHE 


A 


322 


71.346 


33.148 


3.154 


1.00 


45.51 


A 


O 


ATOM 


2480 


CB 


PHE 


A 


322 


70.999 


30.235 


2.321 


1.00 


49.10 


A 


C 


ATC»4 


2481 


CG 


PHE 


A 


322 


69.573 


29.925 


1.935 


1.00 


44.63 


A 


C 


ATOM 


2482 


CDl 


PHE 


A 


322 


68.563 


29.921 


2.894 


1.00 


45.73 


A 


C 


ATOM 


2483 


CD2 


PHE 


A 


322 


69.248 


29.629 


0.623 


1.00 


39.25 


A 


C 


ATOM 


2484 


CEl 


PHE 


A 


322 


67.252 


29.634 


2.539 


1.00 


44.70 


A 


C 


ATOM 


2485 


CE2 


PHE 


A 


322 


67.950 


29.351 


0.261 


1.00 


41.50 


A 


c 


ATOM 


2486 


CZ 


PHE 


A 


322 


66.947 


29.353 


1.218 


1.00 


42.07 


A 


c 


ATOM 


2487 


N 


ALA 


A 


323 


70.339 


33.399 


1.142 


1.00 


47.94 


A 


N 


ATOM 


2488 


CA 


ALA 


A 


323 


69.901 


34.771 


1.336 


1.00 


46.61 


A 


c 


ATOM 


2489 


C 


ALA 


A 


323 


68.426 


34.839 


1.712 


1.00 


44.07 


A 


c 


ATOM 


2490 


O 


ALA 


A 


323 


67.809 


35.887 


1.548 


1.00 


33.93 


A 


O 


ATOM 


2491 


CB 


ALA 


A 


323 


70.133 


35.564 


0.054 


1.00 


49.38 


A 


c 


ATOM 


2492 


N 


ILB 


A 


324 


67.853 


33.725 


2.169 


1.00 


37.73 


A 


N 


ATOM 


2493 


CA 


ILB 


A 


324 


66.520 


33.746 


2.739 


1.00 


38.21 


A 


C 


ATOM 


2494 


C 


ILB 


A 


324 


66.643 


33.442 


4.214 


1.00 


34.73 


A 


C 


ATOM 


2495 


O 


ILB 


A 


324 


67.442 


32.611 


4.619 


1.00 


36.43 


A 


0 


ATOM 


2496 


CB 


ILE 


A 


324 


65.577 


32,736 


2.038 


1.00 


38.89 


A 


c 


ATOM 


2497 


CGI 


ILE 


A 


324 


65.714 


32.862 


0.518 


1.00 


38.34 


A 


c 


ATOM 


2498 


CG2 


ILE 


A 


324 


64.126 


32.960 


2.495 


1.00 


40.76 


A 


c 


ATOM 


2499 


CDl 


ILE 


A 


324 


64.684 


32.110 


-0.277 


1,00 


41.65 


A 


c 


ATOM 


2500 


N 


SER 


A 


325 


65.840 


34 .112 


5.020 


1.00 


32.69 


A 


N 


ATOM 


2501 


CA 


SBR 


A 


325 


66.013 


34 .031 


6.4 60 


1.00 


38.71 


A 


c 


ATOM 


2502 


C 


SER 


A 


325 


64.722 


34.407 


7.139 


1.00 


39.64 


A 


c 


ATOM 


2503 


O 


SER 


A 


325 


63.792 


34.883 


6.509 


1.00 


40.29 


A 


0 


ATOM 


2504 


CB 


SER 


A 


325 


67.150 


34.953 


6.925 


1.00 


40.03 


A 


c 


ATOM 


2505 


OG 


SER 


A 


325 


66.788 


36.327 


6.838 


1.00 


38.81 


A 


0 


ATOM 


2506 


N 


GLN 


A 


326 


64.677 


34.221 


8.440 


1.00 


38.99 


A 


N 


ATOM 


2507 


CA 


GLN 


A 


326 


63.414 


34.244 


9.134 


1.00 


39.09 


A 


c 


ATOM 


2508 


C 


GLN 


A 


326 


63.294 


35.561 


9.885 


1,00 


36.76 


A 


c 


ATOM 


2509 


O 


GLN 


A 


326 


64.259 


36.297 


10.004 


1.00 


39.83 


A 


o 


ATOM 


2510 


CB 


GLN 


A 


326 


63.287 


33.011 


10.037 


1.00 


41.88 


A 


c 


ATOM 


2511 


CG 


GLN 


A 


326 


64.481 


32.024 


9.957 


1,00 


45.10 


A 


c 


ATOM 


2512 


CD 


GLN 


A 


326 


64.374 


30.817 


10.892 


1,00 


50.52 


A 


c 


ATOM 


2513 


DEI 


GLN 


A 


326 


65.393 


30.286 


11.322 


1.00 


51.97 


A 


O 


ATOM 


2514 


NE2 


GLN 


A 


326 


63.151 


30.388 


11.202 


1.00 


51.73 


A 


N 


ATOM 


2515 


N 


SER 


A 


327 


62.108 


35.878 


10.374 


1.00 


30.92 


A 


N 


ATOM 


2516 


CA 


SER 


A 


327 


61.904 


37.188 


10.988 


1.00 


35-17 


A 


c 


ATOM 


2517 


C 


SER 


A 


327 


60.788 


37.140 


11.993 


1.00 


36.09 


A 


c 


ATOM 


2518 


O 


SER 


A 


327 


59.978 


36.208 


12.018 


1.00 


38 .75 


A 


0 


ATOM 


2519 


CB 


SER 


A 


327 


61.578 


38.259 


9.921 


1.00 


35.42 


A 


c 


ATOM 


2520 


OG 


SER 


A 


327 


60.882 


39.380 


10.482 


1.00 


33.54 


A 


0 


ATOM 


2521 


N 


SER 


A 


328 


60.723 


38.174 


12.808 


1.00 


31.04 


A 


N 


ATOM 


2522 


CA 


SER 


A 


328 


59.597 


38.328 


13.697 


1,00 


37.93 


A 


c 


ATOM 


2523 


C 


SER 


A 


328 


58.868 


39,654 


13.520 


1.00 


39.07 


A 


c 


ATOM 


2524 


O 


SER 


A 


328 


57.960 


39.967 


14.296 


1.00 


43.35 


A 


0 


ATOM 


2525 


CB 


SBR 


A 


328 


60.086 


38.167 


15.123 


1.00 


41.74 


A 


C 


ATOM 


2526 


OG 


SER 


A 


328 


60.967 


39.227 


15.485 


1.00 


45.41 


A 


0 


ATOM 


2527 


N 


THR 


A 


329 


59.257 


40.409 


12.492 


1.00 


37.31 


A 


N 


ATOM 


2528 


CA 


THR 


A 


329 


58.675 


41,715 


12.186 


1.00 


40.83 


A 


C 


ATOM 


2529 


C 


THR 


A 


329 


58.020 


41.703 


10.797 


1.00 


39.53 


A 


c 


ATOM 


2530 


O 


THR 


A 


329 


57.814 


42.771 


10.218 


1.00 


37.06 


A 


0 


ATOM 


2531 


CB 


THR 


A 


329 


59.770 


42.814 


12.193 


1.00 


42.95 


A 


c 


ATOM 


2532 


OGl 


THR 


A 


329 


60.831 


42.455 


11.287 


1.00 


41,26 


A 


0 


ATOM 


2533 


CG2 


THR 


A 


329 


60.441 


42.933 


13.558 


1,00 


44.90 


A 


c 


ATOM 


2534 


N 


GLY 


A 


330 


57.724 


40.510 


10,270 


1.00 


37.04 


A 


N 


ATOM 


2535 


CA 


GLY 


A 


330 


57.032 


40.358 


8,988 


1,00 


35,68 


A 


c 


ATOM 


2536 


C 


GLY 


A 


330 


57.929 


39.930 


7,836 


1.00 


34.85 


A 


c 


ATOM 


2537 


O 


GLY 


A 


330 


59.067 


39.526 


8.047 


1.00 


32.50 


A 


0 


ATOM 


2538 


N 


THR 


A 


331 


57.398 


39.974 


6.605 


1.00 


33.39 


A 


N 
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ATOM 


2539 


CA 


THR 


A 


331 


58.207 


39.709 


5.419 


1.00 


28.89 


A 


C 


ATOM 


2540 


C 


THR 


A 


331 


58.979 


40.971 


4.998 


1.00 


27.79 


A 


C 


ATOM 


2541 


0 


THR 


A 


331 


58.496 


42.086 


5.175 


1.00 


33.40 


A 


o 


ATOM 


2542 


CB 


THR 


A 


331 


57.320 


39.267 


4 .249 


1.00 


30.99 


A 


c 


ATOM 


2543 


OGl 


THR 


A 


331 


56.695 


38.020 


4.561 


1.00 


30.88 


A 


o 


ATOM 


2544 


CG2 


THR 


A 


331 


58.157 


38.989 


2.983 


1.00 


33.73 


A 


c 


ATOM 


2545 


N 


VAL 


A 


332 


60.177 


40.764 


4.470 


1.00 


22.23 


A 


N 


ATOM 


2546 


CA 


VAL 


A 


332 


61.065 


41.847 


3,989 


1.00 


27.82 


A 


C 


ATOM 


2547 


C 


VAL 


A 


332 


61.630 


41.466 


2.632 


1.00 


24.10 


A 


C 


ATOM 


2548 


O 


VAL 


A 


332 


62.394 


40.540 


2.501 


1.00 


26.31 


A 


O 


ATOM 


2549 


CB 


VAL 


A 


332 


62.274 


42.139 


4 .935 


1.00 


28.52 


A 


C 


ATOM 


2550 


CGI 


VAL 


A 


332 


63.138 


43.293 


4.367 


1.00 


32.83 


A 


C 


ATOM 


2551 


CG2 


VAL 


A 


332 


61.801 


42.468 


6.329 


1.00 


30.87 


A 


c 


ATOM 


2552 


N 


MET 


A 


333 


61.242 


42.199 


1.591 


1.00 


29.37 


A 


N 


ATOM 


2553 


CA 


MET 


A 


333 


61.760 


41.968 


0.251 


1.00 


23,35 


A 


C 


ATOM 


2554 


C 


MET 


A 


333 


63.012 


42.813 


-0.015 


1.00 


26.09 


A 


C 


ATOM 


2555 


0 


MET 


A 


333 


62.920 


43.935 


-0.512 


1.00 


22.66 


A 


o 


ATOM 


2556 


CB 


MET 


A 


333 


60.687 


42 .296 


-0.804 


1.00 


26.84 


A 


C 


ATOM 


2557 


CG 


MET 


A 


333 


59.550 


41.295 


-0.855 


1.00 


28.90 


A 


C 


ATOM 


2558 


SD 


MET 


A 


333 


58.086 


41.883 


-1.807 


1.00 


33.93 


A 


S 


ATC^ 


2559 


CE 


MET 


A 


333 


58.640 


41.701 


-3.325 


1.00 


30.83 


A 


c 


ATOM 


2560 


N 


GLY 


A 


334 


64.179 


42.265 


0.294 


1.00 


26.44 


A 


N 


ATOM 


2561 


CA 


GLY 


A 


334 


65.428 


43.015 


0.190 


1 .00 


28.29 


A 


C 


ATOM 


2562 


C 


GLY 


A 


334 


66.044 


43.002 


-1.185 


1.00 


28. 85 


A 


C 


ATOM 


2563 


0 


GLY 


A 


334 


65 .370 


42.791 


-2 . 185 


1.00 


28.04 


A 


o 


ATOM 


2564 


N 


7VLA 


A 


335 


67 .350 


43 .220 


-1 .243 


1. 00 


29. 46 


A 


N 


ATOM 


2565 


CA 


ALA 


A 


335 


68.097 


43.214 


-2.489 


1.00 


28.30 


A 


C 


ATOM 


2566 


C 


ALA 


A 


335 


67.939 


41.952 


-3.330 


1.00 


31.11 


A 


C 


ATOM 


2567 


0 


ALA 


A 


335 


68,001 


42.021 


-4.563 


1,00 


29.82 


A 


0 


ATOM 


2568 


CB 


ALA 


A 


335 


69.578 


43.470 


-2.206 


1.00 


34.20 


A 


C 


ATOM 


2569 


K 


VAL 


A 


336 


67.738 


40.805 


-2.671 


1.00 


31.66 


A 


N 


ATOM 


2570 


CA 


VAL 


A 


336 


67.506 


39.532 


-3.349 


1,00 


35.42 


A 


C 


ATCafl 


2571 


C 


VAL 


A 


336 


66.412 


39.733 


-4.393 


1.00 


32.46 


A 


C 


ATOM 


2572 


o 


VAL 


A 


336 


66.626 


39.464 


-5.574 


1.00 


36.08 


A 


O 


ATOM 


2573 


CB 


VAL 


A 


336 


67.096 


38.405 


-2.341 


1 .00 


38.14 


A 


C 


ATOM 


2574 


CGI 


VAL 


A 


336 


66.466 


37 .196 


-3 .057 


1.00 


41.63 


A 


C 


ATOM 


2575 


CG2 


VAL 


A 


336 


68 .294 


37.960 


-1.518 


1 .00 


42.38 


A 


c 


ATOM 


2576 


N 


ILE 


A 


337 


65 ,271 


40.248 


-3,944 


1 ,00 


32.57 


A 


N 


ATOM 


2577 


CA 


ILE 


A 


337 


64.130 


40.507 


-4,832 


1.00 


31.81 


A 


c 


ATOM 


2578 


C 


ILE 


A 


337 


64.389 


41.687 


-5 .760 


1.00 


29.91 


A 


c 


ATOM 


2579 


0 


ILE 


A 


337 


64.231 


41.592 


-6.969 


1.00 


27.22 


A 


0 


ATOM 


2580 


CB 


ILE 


A 


337 


62.835 


40.731 


-4.005 


1.00 


34.05 


A 


c 


ATOM 


2581 


CGI 


ILE 


A 


337 


62.466 


39.472 


-3.216 


1.00 


37.53 


A 


c 


ATOM 


2582 


CG2 


ILE 


A 


337 


61.668 


41.174 


-4.903 


1.00 


34.31 


A 


c 


ATOM 


2583 


CDl 


ILE 


A 


337 


61.814 


38.383 


-4,043 


1.00 


39.14 


A 


c 


ATOM 


2584 


N 


MET 


A 


338 


64 .813 


42 . 816 


-5.202 


1.00 


28.42 


A 


N 


ATOM 


2585 


CA 


MET 


A 


338 


64 ,909 


44 .038 


-5. 998 


1.00 


27 .14 


A 


c 


ATOM 


2586 


C 


MET 


A 


338 


65.914 


43.969 


-7.154 


1.00 


28.91 


A 


C 


ATOM 


2587 


0 


MET 


A 


338 


65.753 


44.650 


-8.166 


1.00 


28.30 


A 


O 


ATOM 


2588 


CB 


MET 


A 


338 


65.195 


45 .214 


-5.067 


1 .00 


25.93 


A 


c 


ATOM 


2589 


CG 


MET 


A 


338 


64 .083 


45.457 


-4.082 


1.00 


27.03 


A 


c 


ATOM 


2590 


SD 


MET 


A 


338 


64.367 


46.907 


-3.076 


1.00 


25.07 


A 


s 


ATOM 


2591 


CE 


MET 


A 


338 


64.174 


48.235 


-4.312 


1.00 


20.81 


A 


c 


ATOM 


2592 


N 


GLU 


A 


339 


66.954 


43.142 


-7.040 


1.00 


29.81 


A 


N 


ATOM 


2593 


CA 


GLU 


A 


339 


67.909 


43.018 


-8.142 


1.00 


32.30 


A 


c 


ATOM 


2594 


C 


GLU 


A 


339 


67.318 


42.371 


-9.403 


1.00 


30.06 


A 


c 


ATOM 


2595 


0 


GLU 


A 


339 


67 .874 


42.502 


-10.488 


1.00 


33. 61 


A 


O 


ATOM 


2596 


CB 


GLU 


A 


339 


69.174 


42 .269 


-7 .704 


1 .00 


33.90 


A 


c 


ATOM 


2597 


CG 


GLU 


A 


339 


70 . 197 


43 .177 


-7.027 


1.00 


38.40 


A 


c 


ATOM 


2598 


CD 


GLU 


A 


339 


71. 139 


42 . 424 


-6,107 


1 .00 


42.07 


A 


c 
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A 


c 


ATOM 


2743 


M 


SER 


A 


357 


63 . 853 


43.982 


-14 .793 


1.00 


24 .48 


A 


N 


ATOM 


2744 


CA 


SER 


A 


357 


64 . 963 


44.919 


-14 .883 


1.00 


26.30 


A 


c 


ATOM 


2745 


C 


SER 


A 


357 


65.767 


44.633 


-16.142 


1.00 


26.78 


A 


c 


ATOM 


2746 


o 


SER 


A 


357 


66.071 


43.481 


-16.420 


1.00 


30.47 


A 


o 


ATOM 


2747 


CB 


SER 


A 


357 


65 . 896 


44 , 775 


-13 . 676 


1 .00 


25 . 63 


A 


c 


ATOM 


2748 


OG 


SER 


A 


357 


67 . 009 


45 , 645 


-13 .815 


1.00 


30. 40 


A 


o 


ATOM 


2749 


N 


ALA 


A 


358 


66. 128 


45 . 682 


-16.867 


1.00 


32 .11 


A 


N 


ATOM 


2750 


CA 


ALA 


A 


358 


67 .012 


45 . 567 


-18 . 029 


1 .00 


36,75 


A 


C 


ATOM 


2751 


C 


ALA 


A 


358 


68 .445 


45 . 147 


-17 . 666 


1 . 00 


38 . 55 


A 


C 


ATOM 


2752 


o 


ALA 


A 


358 


69 . 233 


44 . 838 


-18 .560 


1 .00 


42 .49 


A 


O 


ATOM 


2753 


CB 


ALA 


A 


358 


67 . 025 


46.881 


-18 .802 


1 .00 


37 . 17 


A 


C 


ATOM 


2754 


N 


CYS 


A 


359 


68 . 782 


45.129 


-16.374 


1.00 


39 . 61 


A 


N 


ATOM 


2755 


CA 


CYS 


A 


359 


70, 124 


44 .742 


-15. 920 


1.00 


41.87 


A 


C 


ATOM 


2756 


C 


CYS 


A 


359 


70. 169 


43.490 


-15.049 


1.00 


42.74 


A 


C 


ATOM 


2757 


o 


CYS 


A 


359 


71.241 


43.132 


-14.550 


1.00 


45. 60 


A 


o 


ATOM 


2758 


CB 


CYS 


A 


359 


70.801 


45.913 


-15.175 


1.00 


41. 64 


A 


c 


ATOM 


2759 


SG 


CYS 


A 


359 


70.275 


46.154 


-13.447 


1.00 


42. 44 


A 


S 


ATOM 


^ / OU 


N 


HIS 


A 


360 


69 . 040 


42 , 811 


-14 . 847 


1 . 00 


42 . 50 


A 


N 


ATOM 


2761 


CA 


HIS 


A 


360 


69.071 


41.569 


-14.081 


1.00 


43.24 


A 


C 


ATOM 


2762 


C 


HIS 


A 


360 


69.903 


40.538 


-14 .848 


1.00 


44.08 


A 


C 


ATOM 


2763 


0 


HIS 


A 


360 


69.932 


40.545 


-16.089 


1.00 


37.43 


A 


o 


ATOM 


2764 


CB 


HIS 


A 


360 


67.665 


41.037 


-13.772 


1.00 


43.88 


A 


C 


ATOM 


2765 


CG 


HIS 


A 


360 


67.018 


40.307 


-14.909 


1.00 


42.46 


A 


C 


ATOM 


2766 


NDl 


HIS 


A 


360 


66.587 


40.941 


-16.054 


1.00 


43,80 


A 


N 


ATOM 


2767 


CD2 


HIS 


A 


360 


66.711 


38.997 


-15.067 


1.00 


43.40 


A 


C 


ATOM 


2768 


CEl 


HIS 


A 


360 


66.053 


40,053 


-16.876 


1.00 


42.29 


A 


C 


ATOM 


2769 


NB2 


HIS 


A 


360 


66.107 


38.867 


-16.295 


1.00 


40.67 


A 


N 
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ATOH 


2770 


N 


VM 


A 


361 


70. 604 


39.688 


-14 .108 


1.00 


46.31 


A 


N 


ATOM 


2771 


OA 


VAL 


A 


361 


71 . 444 


38.671 


-14 .736 


1 ,00 


53.46 


A 


C 


ATOM 


2772 


C 


VAL 


A 


361 


70.569 


37.519 


-15.208 


1,00 


55.15 


A 


C 


ATOM 


2773 


0 


VAL 


A 


361 


69.788 


36.965 


-14.433 


1 ,00 


55.26 


A 


O 


ATOM 


2774 


CB 


VAL 


A 


361 


72 .584 


38.144 


-13.812 


1 .00 


55.33 


A 


C 


ATOM 


2775 


CGI 


VAL 


A 


361 


73.724 


39.146 


-13,769 


1 .00 


58.02 


A 


C 


ATOM 


2776 


CG2 


VAL 


A 


361 


72 .086 


37.824 


-12.392 


1 .00 


57 . 18 


A 


C 


ATOM 


2777 


N 


HIS 


A 


362 


70. 687 


37.191 


-16.491 


1 .00 


58 .23 


A 


N 


ATOM 


2778 


OA 


HIS 


A 


362 


69.957 


36.071 


-17 .078 


1 .00 


61.27 


A 


C 


ATOM 


2779 


C 


HIS 


A 


362 


70.886 


35.268 


-17.991 


1.00 


63.90 


A 


C 


ATOM 


2780 


0 


HIS 


A 


362 


72 .106 


35.470 


-17 .978 


1 .00 


63.06 


A 


0 


ATOM 


2781 


CB 


HIS 


A 


362 


68 .707 


36.570 


-17 .820 


1.00 


61 . 80 


A 


C 


ATOM 


2782 


CG 


HIS 


A 


362 


68 . 987 


37 . 603 


-18 .869 


1 .00 


64 .30 


A 


C 


ATOM 


2783 


NDl 


HIS 


A 


362 


69.075 


38.949 


-18.582 


1 .00 


65.54 


A 


N 


ATOM 


2784 


CD2 


HIS 


A 


3 62 


69.176 


37.491 


-20.206 


1.00 


66.55 


A 


C 


ATOM 


2785 


CEl 


HIS 


A 


3 62 


69.318 


39.621 


-19. 694 


1 .00 


66.26 


A 


C 


ATOM 


2786 


NE2 


HIS 


A 


362 


69.384 


38.760 


-20.694 


1.00 


67.04 


A 


N 


ATOM 


2787 


N 


ASP 


A 


363 


70.311 


34.348 


-18.765 


1.00 


66.38 


A 


N 


ATOM 


2788 


CA 


ASP 


A 


363 


71.086 


33.477 


-19.645 


1.00 


68.15 


A 


C 


ATOM 


2789 


C 


ASP 


A 


363 


70.180 


32.971 


-20.779 


1.00 


69.51 


A 


C 


ATOM 


2790 


O 


ASP 


A 


363 


69.558 


33.787 


-21.466 


1. 00 


68.87 


A 


O 


ATOM 


2791 


CB 


ASP 


A 


363 


71 .711 


32 .343 


-18 .820 


1 . 00 


67 . 85 


A 


C 


ATOM 


2792 


CG 


ASP 


A 


363 


70 . 722 


31 .702 


-17 .869 


1 , 00 


67 . 03 


A 


c 


ATOM 


2793 


ODl 


ASP 


A 


363 


71 . 157 


31 .015 


-16. 923 


1 . 00 


67 . 42 


A 


0 


ATOM 


2794 


0D2 


ASP 


A 


363 


69 .490 


31 .839 


-17 . 981 


1 . 00 


67 .25 


A 


0 


ATOM 


2795 


N 


GLU 


A 


364 


70 . Ill 


31.651 


-20. 981 


1 . 00 


71 ,44 


A 


N 


ATOM 


2796 


CA 


GLU 


A 


364 


69 .186 


31.037 


-21.944 


1.00 


71 . 30 


A 


C 


ATOM 


2797 


C 


GLU 


A 


364 


68 . 223 


30.026 


-21,289 


1.00 


69 .20 


A 


c 


ATOM 


2798 


0 


GLU 


A 


364 


67.280 


29.569 


-21 .938 


1.00 


70.14 


A 


0 


ATOM 


2799 


CB 


GLU 


A 


364 


69.980 


30.351 


-23.069 


1.00 


73 .26 


A 


c 


ATOM 


2800 


CG 


GLU 


A 


364 


69.968 


31.097 


-24 .399 


1.00 


74 .89 


A 


c 


ATOM 


2801 


CD 


GLU 


A 


364 


70. 651 


32.451 


-24 .320 


1.00 


76.70 


A 


c 


ATOM 


2802 


OEl 


GLU 


A 


364 


71 .868 


32.494 


-24 .028 


1 .00 


77 .66 


A 


o 


ATOM 


2803 


OE2 


GLU 


A 


364 


69. 969 


33.476 


-24 ,549 


1.00 


79.09 


A 


o 


ATOM 


2804 


N 


PHE 


A 


365 


68 .455 


29.685 


-20 .017 


1.00 


66. 67 


A 


N 


ATOM 


2805 


CA 


PHE 


A 


365 


67 . 630 


28 .704 


-19.299 


1.00 


64 .31 


A 


c 


ATOM 


2806 


C 


PHE 


A 


365 


66. 403 


29.347 


-18 . 633 


1.00 


61 .08 


A 


c 


ATOM 


2807 


O 


PHE 


A 


365 


65 . 266 


29.082 


-19. 026 


1.00 


62.40 


A 


o 


ATOM 


2808 


CB 


PHE 


A 


365 


68 . 4 64 


27 .959 


-18 .245 


1.00 


65.86 


A 


c 


ATC^ 


2809 


CG 


PHE 


A 


365 


69.365 


26.886 


-18 . 819 


1.00 


67 .96 


A 


c 


ATOM 


2810 


CDl 


PHE 


A 


365 


70 .557 


27.227 


-19.461 


1.00 


68 .71 


A 


c 


ATOM 


2811 


CD2 


PHE 


A 


365 


69.029 


25.538 


-18.705 


1.00 


68 .02 


A 


c 


ATOM 


2812 


CEl 


PHE 


A 


365 


71 .395 


26.241 


-19. 989 


1.00 


68.45 


A 


c 


ATOM 


2813 


CE2 


PHE 


A 


365 


69 . 860 


24 .545 


-19.232 


1.00 


68 .28 


A 


c 


ATOM 


2814 


CZ 


PHE 


A 


365 


71 . 045 


24 .899 


-19.874 


1.00 


68 .33 


A 


c 


ATOM 


2815 


N 


ARG 


A 


366 


66 . 636 


30. 183 


-17 . 624 


1.00 


54 .06 


A 


N 


ATOM 


2816 


CA 


ARG 


A 


366 


65 . 544 


30.804 


-16. 874 


1 ,00 


49 .84 


A 


C 


ATOM 


2817 


C 


ARG 


A 


366 


65 .747 


32.305 


-16.729 


1,00 


48 .18 


A 


C 


ATOM 


2818 


0 


T^G 


A 


366 


66 . 8 67 


32.816 


-16.857 


1.00 


47 .96 


A 


0 


ATOM 


2819 


CB 


ARG 


A 


366 


65 . 424 


30.159 


-15,490 


1.00 


47.56 


A 


c 


ATOM 


2820 


CG 


ARG 


A 


366 


65.240 


28.655 


-15.525 


1.00 


43.64 


A 


c 


ATOM 


2821 


CD 


ARG 


A 


366 


64 .974 


28.012 


-14 .174 


1.00 


38.98 


A 


c 


ATOM 


2822 


NE 


ARG 


A 


366 


66. 159 


27.987 


-13.327 


1.00 


41.26 


A 


N 


ATOM 


2823 


CZ 


ARG 


A 


366 


66.242 


27.371 


-12.147 


1.00 


36.86 


A 


c 


ATOM 


2824 


NHl 


ARG 


A 


366 


65 .203 


26.718 


-11.644 


1.00 


42.11 


A 


N 


ATOM 


2825 


NH2 


ARG 


A 


366 


67 . 375 


27.413 


-11.471 


1.00 


36.33 


A 


M 


ATOM 


2826 


N 


THR 


A 


3 67 


64 . 654 


33.009 


-16.446 


1.00 


46.71 


A 


N 


ATOM 


2827 


CA 


THR 


A 


367 


64 . 675 


34 .468 


-16.397 


1.00 


4 6. 92 


A 


C 


ATOM 


2828 


C 


THR 


A 


367 


63 .746 


35.043 


-15 .331 


1.00 


44 . 72 


A 


C 


ATOM 


2829 


0 


THR 


A 


367 


62 . 744 


34 . 435 


-14 . 973 


1 . 00 


42 . 54 


A 


O 


ATOM 


2830 


CB 


THR 


A 


367 


64 . 306 


35 .019 


-17 .783 


1 . 00 


4 6. 57 


A 


C 


ATOM 


2831 


OGl 


THR 


A 


367 


65 . 143 


34 . 406 


-18 .774 


1 . 00 


49 . 10 


A 


0 


ATOM 


2832 


CG2 


THR 


A 


367 


64 . 628 


36 . 484 


-17 . 902 


1 . 00 


48 , 33 


A 


C 


ATOM 


2833 


N 


ALA 


A 


368 


64 . 105 


36.219 


-14 .819 


1.00 


43. 12 


A 


N 


ATOM 


2834 


CA 


ALA 


A 


368 


63 . 206 


36, 976 


-13 . 962 


1.00 


37 . 88 


A 


C 


ATOM 


2835 


C 


ALA 


A 


368 


61 . 990 


37 .417 


-14 .772 


1.00 


31. 67 


A 


C 


ATOM 


2836 


0 


ALA 


A 


368 


62 . 091 


37 ,780 


-15.946 


1.00 


33.05 


A 


O 




2837 


CB 


ALA 




368 


63 . 920 
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A 


C 


ATOM 


2838 


■ N 


ALA 


A 


369 


60.827 


37.407 


-14.133 


1.00 


30.27 


A 


N 


ATOM 


2839 


CA 


ALA 


A 


369 


59.608 


37.776 


-14 .828 


1.00 


25,65 


A 


C 


ATOM 


2840 


C 


ALA 


A 


369 


58.590 


38.454 


-13.917 


1.00 


18.85 


A 


C 


ATOM 


2841 


0 


ALA 


A 


369 


58.574 


38.264 


-12.707 


1.00 


26.84 


A 


0 


ATOM 


2842 


CB 


ALA 


A 


369 


58.988 


36.559 


-15.484 


1.00 


26.18 


A 


C 


ATOM 


2843 


N 


VAL 


A 


370 


57.772 


39.267 


-14.543 


1.00 


21.79 


A 




ATOM 


2844 


CA 


VAL 


A 


370 


56.623 


39.891 


-13.921 


1.00 


22.63 


A 


C 


ATOM 


2845 


C 


VAL 


A 


370 


55.460 


39.580 


-14.864 


1.00 


23.34 


A 


C 


ATOM 


2846 


0 


VAL 


A 


370 


55.491 


39.970 


-16.007 


1.00 


23.37 


A 


o 
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ATOM 


2847 


CB 


VAL 


A 


370 


56.806 


41.403 


-13.783 


1 .00 


24 .21 


A 


C 


ATOM 


2848 


CGI 


VAL 


A 


370 


55 . 606 


42.016 


-13.069 


1 .00 


20 . 64 


A 


C 


ATOM 


2849 


CG2 


VAL 


A 


370 


58 . 091 


41 ,738 


-13 . 021 


1 . 00 


24 . 95 


A 


c 


ATOM 


2850 


N 


GLU 


A 


371 


54 . 435 


38.890 


-14 .367 


1 . 00 


23 .45 


A 


N 


ATOM 


2851 


CA 


GLU 


A 


371 


53.364 


38.364 


-15 . 208 


1 . 00 


26. 07 


A 


C 


ATOM 


2852 


C 


GLU 


A 


371 


52.005 


38. 601 


-14 . 556 


1.00 


21 . 52 


A 


c 


ATOM 


2853 


O 


GLU 


A 


371 


51.886 


38.584 


-13.346 


1.00 


22.32 


A 


o 


ATOM 


2854 


CB 


GLU 


A 


371 


53.593 


36.859 


-15.452 


1.00 


29.04 


A 


c 




2855 


CG 


GLU 


A 


371 


54.667 


36.583 


-16.508 


1.00 


37.79 


A 


c 


MGU 


2856 


CD 


GLU 


A 


371 


55.383 


35.235 


-16.373 


1.00 


42 . 98 


A 


c 


ATOM 


2857 


OEl 


GLU 


A 


371 


55.957 


34.768 


-17.389 


1.00 


47 . 96 


A 


o 


ATOM 


2858 


0E2 


GLO 


A 


371 


55. 428 


34.655 


-15.271 


1.00 


46. 63 


A 


o 


ATOM 


2859 


N 


GLY 


A 


372 


50. 997 


38.849 


-15.375 


1 . 00 


23 .24 


A 


N 


ATOM 


2860 


CA 


GLY 


A 


372 


49. 629 


38.995 


-14 .902 


1 . 00 


20. 28 


A 


c 


ATOM 


2861 


C 


GLY 


A 


372 


48 . 652 


39,097 


-16,060 


1 . 00 


21. 84 


A 


c 


ATOM 


2862 


0 


GLY 


A 


372 


49. 087 


39,076 


-17 .231 


1.00 


23. 46 


A 


o 


ATOM 


2863 


N 


PRO 


A 


373 


47 .355 


39.219 


-15 .790 


1.00 


20. 69 


A 


N 


ATOM 


2864 


CA 


FRO 


A 


373 


46.758 


39,117 


-14 .455 


1.00 


18 . 48 


A 


C 


ATOM 


2865 


C 


PRO 


A 


373 


46. 427 


37. 686 


-14.050 


1.00 


19.08 


A 


C 


ATOM 


2866 


O 


PRO 


A 


373 


46.197 


36.821 


-14.906 


1 .00 


22. 98 


A 


o 


ATOM 


2867 


CB 


PRO 


A 


373 


45. 466 


39. 923 


-14 . 607 


1 .00 


20. 66 


A 


C 


ATOM 


2868 


CG 


PRO 


A 


373 


45.050 


39. 692 


-16.023 


1.00 


22 . 14 


A 


c 


ATOM 


2869 


CD 


PRO 


A 


373 


46-328 


39,516 


-16.811 


1.00 


21.78 


A 


c 


ATOM 


2870 


N 


PHE 


A 


374 


46. 385 


37 . 445 


-12 .742 


1 . 00 


15 . 75 


A 


N 


ATOM 


2871 


CA 


PHE 


A 


374 


45 . 882 


36.206 


-12 , 195 


1 . 00 


16. 86 


A 


C 


ATOM 


2872 


C 


PHE 


A 


374 


44 . 622 


36.442 


-11.376 


1. 00 


21, 99 


A 


C 


ATOM 


2873 


O 


PHE 


A 


374 


44 .370 


37.554 


-10.943 


1.00 


19. 16 


A 


0 


ATOM 


2874 


CB 


PHE 


A 


374 


46. 973 


35. 539 


-11 .360 


1.00 


16. 65 


A 


C 


ATOM 


2875 


CG 


PHE 


A 


374 


48. 148 


35. 118 


-12 . 170 


1.00 


19. 99 


A 


C 


ATOM 


2876 


CDl 


PHE 


A 


374 


48 . 097 


33 . 957 


-12 . 936 


1 .00 


23.38 


A 


C 


ATOM 


2877 


CD2 


PHE 


A 


374 


49.294 


35.894 


-12.221 


1 .00 


22. 80 


A 


C 


ATOM 


2878 


CEl 


PHE 


A 


374 


49.189 


33.565 


-13.711 


1 .00 


27 .21 


A 


c 


ATOM 


2879 


CE2 


PHE 


A 


374 


50 . 385 


35.503 


-12 . 980 


1 .00 


22 .14 


A 


c 


ATOM 


2880 


CZ 


PHE 


A 


374 


50. 341 


34 .340 


-13 .722 


1.00 


25 . 11 


A 


c 


ATOM 


2881 


N 


VAL 


A 


375 


43 . 822 


35 .392 


-11 , 207 


1 .00 


21.84 


A 


N 


ATOM 


2882 


CA 


VAL 


A 


375 


42 . 614 


35 .458 


-10 . 407 


1.00 


22 .25 


A 


C 


ATOM 


2883 


C 


VAL 


A 


375 


42 . 948 


34.876 


-9 . 049 


1 .00 


20. 48 


A 


c 


ATOM 


2884 


0 


VAL 


A 


375 


43 . 281 


33. 695 


-8 . 943 


1 .00 


24.24 


A 


o 


ATOM 


2885 


CB 


VAL 


A 


375 


41 . 439 


34 . 656 


-11 .016 


1 . 00 


24 . 98 


A 


c 


ATOM 


2886 


CGI 


VAL 


A 


375 


40 . 206 


34 .758 


-10 .119 


1 . 00 


22 . 92 


A 


c 


ATOM 


2887 


CG2 


VAL 


A 


375 


41. 117 


35.152 


-12.404 


1.00 


23. 43 


A 


c 


ATOM 


2888 


N 


THR 


A 


376 


42.881 


35.710 


-8-023 


1.00 


22.65 


A 


N 


ATOM 


2889 


CA 


THR 


A 


376 
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48.453 40.727 -19.960 1.00 

66.664 48.548 5.951 1.00 

58.083 43.778 -17.062 1.00 

55.799 60.814 5.110 1.00 

79.293 52.119 13.860 1.00 

77.511 45.900 20.280 1.00 

50.802 43.439 -20.117 1.00 

66.106 19.960 -9.172 1.00 

63.894 58.910 -19.204 1.00 

76.257 41.684 15.651 1.00 

54.819 50.279 -18.015 1.00 

65.401 64.403 6.138 1.00 

53.853 55.150 -11.636 1.00 
68.908 67.519 -5.703 1.00 
79.968 52.673 6.743 1.00 
48.181 44.979 -10.637 1-00 
53.488 60.669 -0.029 1.00 
62.724 61.887 9.306 1.00 
64.870 59.282 19.837 1.00 
67.034 55.997 8.478 1.00 
81.783 69.009 13.884 1.00 
62.338 60.129 2.848 1.00 
59.948 49.626 3.509 1.00 
74.315 61.973 -6.807 1.00 
72.754 44.483 0.023 1.00 
85.756 65.674 6.462 1.00 
65.197 62.897 8.395 1,00 
83.185 55.955 4.621 1.00 
68.666 31.435 6.797 1.00 
70.959 50.115 -0.021 1,00 
70.634 69.168 18.081 1.00 
83.133 65.815 2.329 1.00 
81.369 47.920 15.072 1.00 
87.299 59.567 9.845 1.00 

41.854 32.167 -5.319 1.00 
87.742 64.125 6.529 1.00 
72.460 68.092 12.019 1.00 
65.274 42,384 -19.635 1.00 
85.768 65.313 2.708 1.00 
62.071 26.325 -12.323 1,00 
53.548 58.246 7.753 1.00 
48.415 35.384 -17.283 1.00 
63.389 66.452 6.071 1.00 
82.811 58.045 -3.976 1.00 
73.849 44.456 -1.977 1.00 
45.102 52.297 -10.384 1.00 
65.497 47.590 -7.949 1.00 
60.385 50.571 -20.969 1.00 
73.977 51.153 -13.532 1.00 
73.807 75.017 -0.696 1,00 
89.302 56.875 9.021 1.00 
59.573 59.896 2.947 1.00 
69.343 40.980 6.123 1.00 
52.716 58.960 -10.022 1.00 
71.368 68.265 20.363 1.00 
58.025 24.259 10.874 1,00 
79.324 57.854 -5.249 1.00 
52.049 42.888 4.777 1.00 
58.572 51.240 -21.845 1.00 
58.399 59.801 -15.372 1.00 
51.199 63,163 -3.700 1.00 
39.751 42.093 5.333 1.00 
62.377 69.523 12.319 1.00 
57.972 57.007 14.799 1.00 
62.896 33.477 -11.943 1.00 
77.078 56.466 -5.817 1.00 
58.723 72.174 10.770 1.00 
82.563 53.786 6.291 1.00 
59.353 71.034 3.910 1.00 
64.748 30.333 -21.491 1.00 
74.634 59.328 -12.866 1.00 
55.438 42.543 -19.877 1.00 
77.532 77,780 -0.830 1.00 
65.148 68.989 -11.545 1.00 
57.778 41.274 -18.333 1.00 
55.086 59.049 16.334 1.00 
81.228 50.040 13.406 1.00 
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71.098 
55.859 
48.282 
46.171 
50.769 
69.639 
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37.484 
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31.897 
76.127 
59.460 
64.021 
66.667 
75.297 
46.059 
53.491 
57.368 
49.754 
66.881 
34.659 
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75.620 
63.907 
68.239 
63.896 
64.107 
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68.945 
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32.460 
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45.846 
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69.504 
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-8.009 
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-7.695 
1.815 
14.878 
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-3.643 
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-8.023 
-2.622 
-15.833 
-5.405 
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-5.072 
-7.677 
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-8.479 
5.802 
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-11.764 
-5,408 
-2.621 
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-15.255 
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3.636 
7.690 
8.251 
-2.546 
10.615 
-10.561 
-1.573 
15.946 
9.428 
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-2.607 
7.564 
15.948 
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00 45.35 
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00 52.07 
00 31.75 
00 24.22 
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1 
1 
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1.00 33 
1.00 33 
1.00 51.23 
1.00 56.51 
1.00 36.81 
1.00 58.01 
1.00 63.77 
1.00 55.75 
1.00 73.57 
1.00 45.91 
1.00 50.99 
1.00 67.69 
00 33.48 
00 42.03 
00 43.32 
00 30.64 
00 39.92 
1.00 52.71 
1.00 35.96 
00 45.54 
00 37.40 
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00 45.89 
00 38,39 
00 35.36 
1.00 26.82 
1.00 28.56 
1.00 45.91 
1.00 38.29 
1.00 68.20 
1.00 67.09 
1.00 50.12 
1.00 45.17 
1.00 36.02 
1.00 70.84 
1.00 39.36 
1.00 56.31 
1.00 51.35 
1.00 57.63 
1.00 52.43 
1.00 43.55 
1.00 53.68 
1.00 56.75 
1.00 44.26 
1.00 31.34 
1.00 73.50 
1,00 46.42 
1.00 57.59 
1.00 56.49 
1.00 62.98 
1.00 63.14 
1.00 38.50 
1.00 40.86 
1.00 61.27 
1.00 63.55 
1.00 58.76 
1.00 32.19 
1.00 38.10 
1.00 56.30 
1,00 68.96 
1.00 67.43 
1.00 72.28 
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ATOM 3155 0 HOH W 189 

ATOM 3156 0 HOH W 190 

ATOM 3157 O HOH W 191 

ATOM 3158 O HOH W 192 

ATOM 3159 O HOH W 193 

ATOM 3160 O HOH W 194 

ATOM 3161 O HOH W 195 

ATOM 3162 O HOH W 196 

ATOM 3163 O HOH W 197 

ATOM 3164 O HOH W 198 
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71.187 76.958 15.269 

73.886 47.482 4,081 

69.355 68.996 -15.162 

82.777 65.787 -8.682 

39.736 46.583 7.480 

52.055 40.044 -22.266 

71.314 50.785 -16.556 

61.950 38.755 -19.713 

84.051 69.275 5.460 

76.032 60.681 20.880 

73.266 44.918 4.326 

82.129 50.468 -5.451 

83.221 72.917 3.600 

59.652 75.257 4.275 
78.123 47.635 22.706 

77.637 76.375 11.568 
58.555 48.938 13.305 

57.638 66.927 18.153 
58.312 43.498 7.697 
44.538 28.297 3.536 
59.595 53.833 19.308 
57.084 51.317 14.707 
49.436 21.830 -1.938 
60.734 77.657 4.018 
79.123 83.308 3.898 
57.523 61.921 -13.519 
71.168 43.072 5.167 

76.653 84.242 3.301 
42.382 40.135 17.622 
78.733 69.517 -5.343 
62.986 22.749 -4.555 
60.743 44.247 9.220 
57.413 29.275 -13.554 
71.784 39.808 -3.358 
74.571 63.700 -13.618 
71.261 51.431 -13.741 
78.559 79.217 0.998 
68.431 42.241 17.534 
74.858 56.378 23.475 
79.307 60.745 22,219 
60.314 68.573 10.249 
61.602 71.621 -9.518 
49.899 42.585 7.057 
46.590 57.769 2.535 
45.044 34.173 -1.541 
71.447 46.668 -18.182 
73.000 43.214 -18,003 
43.370 55.663 -1.011 
74,007 57.458 -17.330 
78.277 52.906 -16.612 
77.796 59.191 -8.755 
84.436 60.164 -3.135 
65.112 49.259 9.447 
63.207 51.425 10.118 
89.242 51.621 10.559 

88.861 58.033 -1.500 
80.840 77.800 12.517 
77.216 83.653 0.754 
69.579 67.222 23.238 
75.887 51.320 21.816 
68.191 78.916 4.291 
82.004 63.181 21.579 
76.390 67.886 21.910 
53.503 60.921 17.416 
60.509 46.370 -23.693 
53.842 41.622 -18.205 
48.037 45.876 -0.170 
44.592 45.050 2.573 
40.130 44.608 4.624 
69.355 47.143 5.898 
34.957 32.570 1.397 
61.555 31.492 -14.640 

43.862 53.451 -5.566 
84.234 48.309 0.364 
87.932 51.816 -3.215 
82.425 63.456 -6.283 
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CLAIMS 

1. A BACE protein, which comprises the sequence set out in residues 45 to 455 of 
SEQ ED NO:2 (43 to 453 SwissProt P56817), or a fragment thereof comprising . 
residues corresponding to 58 to 398 of SEQ ID NO:2, modified by the following 
changes: • 

(a) substitution or deletion of at least one residue which is a proteolytic cleavage 
site recognised by clostripain; and optionally 

(b) the replacement of from 1 to 30 other amino acids by an equivalent or fewer . 
number of amino acids. 

2. A protein according to claim 1 wherein at least one of residues 44, 47, 57, 58 and 59 
of SEQ ID NO:2 are substituted. 

3. A protein according to claim 1 or 2 wherein residues 58 and/or 59 are lysine. 

4. A protein according to any one of the preceding claims wherein the asparagine 
residues at positions 155, 174, 225 and 356 (SwissProt P56817 153, 172, 223 and 
354) are replaced by glutamine residues. 

5. A protein according to any one of the preceding claims wherein the fragment is 
truncated at the C-terminus such that at least residues 449 et seq. of SEQ ID N0:2 
are absent. 

6. A method of making a truncated BACE protein, which method comprises 
proteolytically cleaving the protein of any one of the preceding claims. 

7. The method of claim 6 wherein said-cleavage is at and including one or more of 
residues 44, 47, 57, 58 and 59. 

8. A BACE protein obtained or obtainable by the method of claim 7. 

9. A protein according to claim 8 wherein the N-terminal is residue 45 of SEQ ID 
N0:2. 

10. A protein according to claim 1 which is selected from: (a) SEQ ID 6; (b) SEQ ID 8; 

(c) SEQ ID 10; (d) SEQ ID 12; (e) SEQ ID 14; (f> SEQ ID 16; (g) SEQ ID 18; or a 
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protein according to claim 7 which is selected from (h) SEQ ID 19; (i) SEQ ID 20; 
G)SEQID2L 

11. Nucleic acid encoding the protein of any one of claims 1 to 5 or 8 to 10. 

12. A vector comprising the nucleic acid of claim 11. 

13. A host cell comprising the vector of claim 12. 

• 14. A process for producing the protein of any one of claims 1 to 5 or 8 to 10 

comprising the steps of: (a) culturing the host cell of claim 13 under conditions 
suitable for expression of the protein; and optionally (b) isolating the expressed 
recombinant BACE protein. 

15. A process for producing refolded recombinant BACE protein comprising the steps 
of: (a) solubilising the recombinant BACE; (b) diluting the solubilised BACE into 
an aqueous buffer containing 10 to 50 mM sulfobetaine; and (c) maintaining the 
diluted solution at low temperature and at high pH for at least 2 weeks. 

1 6. A process for producing a crystal of BACE comprising the step of refolding 
recombinant BACE protein according to the process of claim 14. 

17. A process for producing a crystal of a BACE protein comprising the step of growing 
the crystal by vapour diffusion using a reservoir buffer that contains 1 8-26 % PEG 
5000 MME, 180-220 mM ammonium iodide and 180-220 mM tri-sodium citrate pH 
6.4-6.6, and optionally 0-5% glycerol. 

18. A process according to any one of claims 15 to 17 wherein the BACE protein is 
human BACE. 

19. A process according to any one of claims 15 to 17 wherein the BACE protein is as 
defined in any one of claims 1 to 5 or 8 to 1 0. 

20. A crystal of a BACE protein having a hexagonal space group P6i22. 

21. The crystal of claim 20 having unit cell dimensions of a=b=103.2 A, c=169.1 A, 
a=P=60®, 7=120*", and a unit cell variability of 5% in all dimensions. 
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22. A crystal of a BACE protein comprising a structure defined by all or a portion of the 
co-ordinates of Table 1 + a root mean square deviation from the Ca atoms of less 
than 0.5A. 

23. A crystal of the protein of any one of clauns 1 to 5 or 8 to 10. 

24. The crystal of any one of claims 20 to 23 having a resolution better than 2.5 A. 

25. The crystal of any one of claims 20 to 24 which is soaked with one or more 
compound(s) to form co-complex stmctures. 

26. The crystal of any one of claims 20 to 24 wherein the BACE is co-crystallized with 
one or more compound(s) to form co-crystallized stmctures. 

27. The crystal of any one of claims 20 to 24 which is an apo crystal. 

28. A computer-based method for the analysis of the interaction of a molecular stmcture 
with a BACE protein, which comprises: 

(a) providing a structure comprising a three-dimensional representation of 
BACE or of a portion of BACE, which representation comprises all or a portion 
of the coordinates of Table 1 ± a root mean square deviation from the Ca atoms 
of less than 0.5 A; 

(b) providing a molecular structure to be fitted to said BACE structure; and 

(c) fitting the molecular structure to the BACE structure of (a). 

29. The method of claim 28 wherein the molecular structure to be fitted is in the form of 
a model of a pharmacophore. 

30. The method of claim 28 or 29 wherein the three-dimensional representation is a 
model constructed from all or a portion of the coordinates of Table 1 + a root mean 
square deviation from the Ca atoms of less than 0.5A, 

3 1 . The method of claim 30 wherein the model is: (a) a wire-frame model; (b) a 
chicken-wire model; (c) a ball-and-stick model; (d) a space-filling model; (e) z 
stick-model; (f) a ribbon model; (g) a snake model; (h) an arrow and cylinder 
model; (i) an electron density map; (j) a molecular surface model. 
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32. A computer-based method for the analysis of molecular structures which comprises: 

(a) providing the coordinates of at least two atoms of a BACE structure as 
defined in Table 1 + a root mean square deviation jfrom the Ca atoms of less than 
0.5A ("selected coordinates"); 

(b) providing the structure of a molecular structure to be fitted to the selected 
coordinates; and 

(c) fitting the structure to the selected coordinates of the BACE structure. 

33. The method of claim 32 wherein the selected coordinates are of at least 5, 10, 50, 
100 or 500 atoms. 

34. The method of any one of claims 28 to 33 wherein the coordinates of Table 1 
' represent a binding pocket. 

35. The method of claim 34 wherein the coordinates of Table 1 comprise those relating 
to residues SER71, GLY72, LEU91, ASP93, GLY95, SER96, VAL130, PR0131, 
TYR132, THR133, GLN134, ILE171, ILE179, ILE187, ALA188, ARG189, 
PRO190, TRP258, TYR259, ASP284, LYS285, ASP289, GLY291, THR292, 
THR293, ASN294, ARG296 and ARG368 (based on the numbering of SwissProt 
P56817). 

36. A computer-based method of rational drug design comprising the method of any one 
of claims 28 to 35. 

37. A computer-based method of rational drug design comprising comprising: 

(a) providing the coordinates of at least two atoms of a BACE stmcture as 
defined in Table 1 ± a root mean square deviation from the Ca atoms of less than 
0.5A ("selected coordinates"); 

(b) providing the structures of a plurality of molecular fragments; 

(c) fitting the structure of each of the molecular fragments to the selected 
coordinates; and 
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(d) assembling the molecular fragments into a single molecule to fomi a 
candidate modxilator molecxile. 

38. . A method for identifying a candidate modulator of BACE comprising the steps of: 

(a) employing a three-dimensional structure of BACE, at least one sub-domain 
thereof, or a plurality of atoms thereof, to characterise at least one BACE 
binding cavity, the three-dimensional structure being defined by atomic 
coordinate data according to Table 1 ± a root mean square deviation from the Ca 
atoms of less than 0,5 A; and 

(b) identifying the candidate modulator by designing or selecting a compound 
for interaction with the binding cavity, 

39. The method of any one of claims 28 to 38 further comprising the step of: 

(a) obtaining or synthesising the molecular structure or modulator; and 

(b) contacting the molecular structure or modulator with BACE to determine the 
ability of the molecular structure or modulator to interact with BACE. 

40. A method of assessing the ability of a candidate modulator to interact with BACE 
which comprises the steps of: 

(a) obtaining or synthesising said candidate modulator; 

(b) forming a crystallized complex of a BACE protein of claims 1 to 5 or 8 to 10 
and said candidate modulator; and 

(c) analysing said complex by X-ray crystallography or NMR spectroscopy to 
determine the ability of said candidate modulator to interact with BACE. 

41 . A method for determining the structure of a compound bound to BACE, said 
method comprising: 

(a) mixing BACE with the compound to form a BACE-compound complex; 

(b) crystallizing the BACE-compound complex; and 
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(c) deteraiimng the structure of said BACE-compound(s) complex by reference 
to the data of Table 1 ± a root mean square deviation from the Ca atoms of less 
than 0.5A. 

42. A method for determining the structure of a compound bound to BACE, said 
method comprising: . 

(a) providing a crystal of BACE; 

(b) soaking the crystal with one or more compound(s) to form a complex; and 

(c) determining the structure of the complex by employing the data of Table 1 ± 
a root mean square deviation from the Ca atoms of less than 0.5 A. 

43. A method of determining the three dimensional structure of a BACE homologue or 
analogue of unknown structure, the method comprising the steps of: 

(a) aligning a representation of an amino acid sequence of the BACE homologue 
or analogue with the amino acid sequence of the BACE of Table 1 ± a root mean 
square deviation from the Ca atoms of less than 0.5 A to match homologous 
regions of the amino acid sequences; 

(b) modelling the structure of the matched homologous regions of said target 
BACE of unknown structure on the corresponding regions of the BACE 
structure as defined by Table 1 + a root mean square deviation from the Ca 
atoms of less than 0.5 A; and 

(c) determining a conformation for the BACE homologue or analogue which 
substantially preserves the structure of said matched homologotis regions. 

44. A method of providing data for generating structures and/or performing rational 
drug design for BACE, BACE homologues or analogues, complexes of BACE with 
a potential modulator, or complexes of BACE homologues or analogues with 
potential modulators, the method comprising (i) establishing communication with a 
iremote device containing computer-readable data comprising at least one of: 

(a) atomic coordinate data according to Table 1 ± a root mean square deviation 
from the Ca atoms of less than 0.5A, said data defining the three-dimensional 
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structure of B ACE, at least one sub-domain of the three-dimensional structure of 
BACE, or the coordinates of a plurality of atoms of B ACE; 

(b) structure factor data for BACE, said structure factor data being derivable 
from the atomic coordinate data of Table 1 ± a root mean square deviation from 
the Ca atoms of less than 0.5A; 

(c) atomic coordinate data of a target BACE homologue or analogue generated 
by homology modelling of the target based on the data of Table 1 + a root mean 
square deviation from the Ca atoms of less than 0.5A; , 

(d) atomic coordinate data of a protein generated by interpreting X-ray 
crystallographic data or NMR data by reference to the data of Table 1 ± a root 
mea£i sqxiare deviation from the Ca atoms of less than 0.5 A; and 

(e) structure factor data derivable from the atomic coordinate data of (c) or (d); 
and 

(ii) receiving said computer-readable data from said remote device. 
45. A computer system containing one or more of: 

(a) atomic coordinate data according to Table 1 ± a root mean square deviation 
from the Ca atoms of less than 0.5 A, said data defining the three-dimensional 
structure of BACE or at least selected coordinates thereof; 

(b) structure factor data (where a stmcture factor comprises the amplitude and 
phase of the diffracted wave) for BACE, said structure factor data being 
derivable from the atomic coordinate data of Table 1 ± a root mean square 
deviation from the Ca atoms of less than 0.5A; 

(c) atomic coordinate data of a target BACE protein generated by homology 
modelling of the target based on the data of Table 1 ± a root mean square 
deviation from the Ca atoms of less than 0.5A; 

(d) atomic coordinate data of a target BACE protein generated by interpreting X 
ray crystallographic data or NMR data by reference to the data of Table 1 ± a 
root mean square deviation from the Ca atoms of less than 0.5 A; or 
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(e) structure factor data derivable from the atomic coordinate data of (c) or (d). 

46. The computer system of claim 45 comprising: a computer-readable data storage 
medixmi comprising data storage material encoded with the computer-readable data; 

(a) a Working memory for storing instructions for processing said computer- 
readable data; and 

(b) a central-processing unit coupled to said working memory and to said 
computei:-readable data storage medium for processing said computer-readable 
data and thereby generating structures and/or performing rational drug design, 

47. A method for determining the structure of a protein, which method comprises; 
providing the co-ordinates of Table 1 ± a root mean square deviation from the Ca 
atoms of less than 0.5A, and either 

(a) positioning the co-ordinates in the crystal xmit cell of said protein so as to 
provide a structure.for said protein or 

(b) assigning NMR spectra Peaks of said protein by manipulating the 
coordinates of Table 1 ± a root mean square deviation from the Ca atoms of less 
than 0.5A. 

48. A method of preparing a composition comprising identifying a molecular structure 
or modulator according to the method of any one of claims 28 to 40, and admixing 
the molecule with a carrier. 

49. A process for producing a medicament, pharmaceutical composition or drug, the 
process comprising: (a) identifying a molecular structure or modulator according to 
the method as defined in any one of claims 28 to 40; and (b) preparing a 
medicament, pharmaceutical composition or drug containing the optimised 
modulator molecule. 

50. A process according to claim 49 which comprises (a) identifying a molecular 
structure or modulator according to the method as defined in any one of claims 28 to 
40; (b) optimising the structure of the modulator molecule; and (c) preparing a 
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medicament, pharmaceutical composition or drug containing the optimised 
modulator molecule. 

51. A compound identified, produced or obtainable by the process or method of any one 
ofclaims28 to40. 

52. A compound of claim 5 1 or composition thereof for use in medicine. 
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FIGURE 2 
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SEQ ID 1 : shows the DNA sequence coding for the BACE protein, BACH WT. 

ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCATGGCGGGAGTGCTGCCT 
GCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAG 
ATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAAC 
TTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCC 
AGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAA 
GGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCC 
AACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGC 
ATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTT 
GACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCT 
GGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGA 
GGTATCGACCACTCGCTGTACACAGGCAGTCirCTGGTATACACCCATCCGGCGGGAGTGG 
TATTATGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGC 
AAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCC 
AAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTC 
CCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGG 
AACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGC 
ATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGAC 
GACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATC 
ATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGC 
GCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTG 
GACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCATAA 

SEQ ID 2: shows the deduced amino acid sequence for BACE WT. 

MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLEILPRETDEEPEEPGRRGSFVEMVDNLRGKSG 
QGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLV 
SIPHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGF 
PLNQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTN 
LRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLR 
PVEDVAT3QDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMED 
CGYNIPQTDES 

SEQ ID 3 : shows the DNA sequence coding for the BACE protein, BAGE N->Q, 

ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCATGGGGGGAGTGCTGCCT 
GCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGC 
AGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAG 
ATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAAC 
TTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCC 
AGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAA 
GGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCCAGGTCACTGTGCGTGCC 
AACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCCAGGGCTCCAACTGGGAAGGC 
ATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTT 
GACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCT- 
GGCTTCCCCCTCCAGCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGA 
GGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGG 
TATTATGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGC 
AAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCC 
AAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTC 
CCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGG 
AACATTTTCCCAGTQATCTCACTCTACCTAATGGGTGAGGTTACCCAGCAGTCCTTCCGC 
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ATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGAC 
GACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGGTGTTATC 
ATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGC 
GCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTG 
GACATGGAAGACTGTGGCTACAACATTCCACA.GACAGATGAGTCACATCACCATCATCAC 
CACTAA 

SEQ ID 4: shows the deduced amino acid sequence for BACE N->Q. 



masmtggqqmgrgsmagvlpahgtqhgirlplrsglggaplglrlpretdeepeepgrrgsfvemvdnlrgksg 
qgyyvemtvgsppqtlnilvdtgssnfavcjaaphpflhryyqrqlsstyrdlrkgvyvpytOgkwegelgtdlv 

siphgpqvtvraniaaitesdkffiqgsnwegilglayaeiarpddslepffdslvkqthvpnlfslqlcgagf 
plqqsevlasvggsmiiggidhslytgslwytpirrewyyeviivrveingqdlkmdckeynydksivdsgttn 
lrlpkkvfeaavksikaasstekfpdgfwlgeqlvcwqagttpwnifpvislylmgevtqqsfritilpqqylr 
pvedvatsqddcykfaisqsstgtvmgavimegfywfdrarkrigfavsachvhdefrtaavegpfvtldmed 
cgyni pqtdeshhhhhh 

SEQ ID 5: shows the DNA sequence coding for the BACE WT R56KR57K. 



ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCATGGCGGGAGTGCTGCCT 
GCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 

ctggggctgcggctgccccgggagaccgacgaagagcccgaggagcccggcaagaagggc 

agctttgtggagatggtggacaacctgaggggcaagtcggggcagggctactacgtggag . 

atgaccgtgggcagccccccgcagacgctcaacatcctggtggatacaggcagcagtaac 

tttgcagtgggtgctgccccccaccccttcctgcatcgctactaccagaggcagctgtcc 

agcacataccgggacctccggaagggtgtgtatgtgccctacacccagggcaagtgggaa 

ggggagctgggcaccgacctggtaagcatcccccatggccccaacgtcactgtgcgtgcc 

aacattgctgccatcactgaatcagacaagttcttcatcaacggctccaactgggaaggc 

atcctggggctggcctatgctgagattgccaggcctgacgactccctggagcctttcttt 

GACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCT 
GGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGA 
GGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGG 
TATTATGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAT^AATGGACTGC 
AAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCC 
AAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTC 
CCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGG 
AACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGC 
ATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGAC 
GACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATC 
ATGGAGGGCTTCTACGTTGTCTTTGATGGGGCCCGAAAACGAATTGGCTTTGCTGTCAGC 
GCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTG 
GACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCATAA 

SEQ ID 6: shows the deduced amino acid sequence for BACE WT R56KR57K 



masmtggqqmgrgsmagvlpahgtqhgirlplrsglggaplglrlpretdeepeepgkkgsfvemvdnlrgksg 
qgyyvemtvgsppqtlnilvdtgssnfavgaaphpfluryyqrqlsstyrdlrkgvyvpytqgkwegelgtdlv 
sxphgpnvtvraniaaitesdkffingsnwegilglayaeiarpddslepffdslvkqthvpnlfslqlcgagf 
plnqsevlasvggsmiiggidhslytgslwytpirrewyyeviivrveingqdlkmdckeynydksivdsgttn 
lrlpkkvfeaavksikaasstekfpdgfwlgeqlvcwqagttpwnifpvislylmgevtnqsfritilpqqylr 
pvedvatsqddcykfaisqsstgtvmgavimegfywfdrarkrigfavsachvhdefrtaavegpfvtldmed 
cgynipqtdes 

SEQ ID 7: shows the DNA sequence coding for the BACE WT R57K. 

ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCATGGCGGGAGTGCTGCCT 
GCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAAGGGC 
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AGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAG 
ATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAAC 
TTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCC 
AGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAA 
GGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCC 
AACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGC 
ATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTT 
GACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCT 
GGCTTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGA 
GGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGG 
TATTATGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGC 
AAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCC 
AAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTC 
CCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGG 
AACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGC 
ATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGAC 
GACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATC 
ATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGC 
GCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTG 
GACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCATAA 

SEQ ID 8: shdws the deduced amino acid sequence for BACE WT R57K. 

MASMTGGQQMGRGSMAGVLPAHGTQHGIEILPLRSGLGGAPLGLEILPRETDEEPEEPGRKGSFVEMVDNLRGKSG 
QGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLV 
SIPHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGF 
PLNQSEVIiASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCBCEYNYDKSIVDSGTTN 
LRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLR 
PVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYVVFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMED 
CGYNIPQTDES 

SEQ ID 9: shows the DNA sequence coding for the BACE WT R57DEL. 

ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCATGGCGGGAGTGCTGCCT 

GCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 

CTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCAGGGGCAGC 

TTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATG 

ACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTT 

GCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTAqCAGAGGCAGCTGTCCAGC 

ACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGG 

GAGCTGGGCACCGACCTGG.TAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAAC 

ATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATC 

CTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC 

TCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGC 

TTCCCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGT 

ATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTAT 

TATGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAG 

GAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAG 

AAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT- 

GATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAAC 

ATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATC 

ACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGAC 

TGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 

GAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCT 

TGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGfCACCTTGGAC 

ATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCATAA 
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. SEQ ID 10: shows the deduced amino acid sequence for BACE WT R57dci. 

MASMTGGQQMGRGSMAGVLPAHGTQHGIEUiPLRSGLGGAPLGLRLPRETDEEPEEPGRGSFVElWDNLRGKSGQ 
GYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVyVPYTQGECWEGELGTDLVS 
IPHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFP 
LNQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNL 
RLPBCKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTNQSFRITILPQQYLRP 
VEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDC 
GYNIPQTDES 

SEQ ID 1 1 : shows the .DNA sequence coding for the BACE N->Q R56KR57K. 

. atggctagcatgactggtggacagcaaatgggtcgcggatccatggcgggagtgctgcct 
gcccacggcacccagcacggcatccggctgcccctgcgcagcggcctggggggcgccccc 
ctggggctgcggctgccccgggagaccgacgaagagcccgaggagcccggcaagaagggc 
agctttgtggagatggtggacaacCtgaggggcaagtcggggcagggctactacgtggag 
atgaccgtgggcagccccccgcagacgctcaacatcctggtggatacaggcagcagtaac 
tttgcagtgggtgctgccccccaccgcttcctgcatcgctactaccagaggcagctgtcc 
agcacataccgggacctccggaagggtgtgtatgtgccctacacccagggcaagtgggaa 
ggggagctgggcaccgacctggtaagcatcccccatggcccccaggtcactgtgcgtgcc 

AACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCCAGGGCTCCAACTGGGAAGGC 
ATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTT 
GACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCT 
GGCTTCCCCCTCCAGCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGA 
GGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGG 
TATTATGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGC 
AAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCC 
AAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTC 
CCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGG 
AACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCCAGCAGTCCTTCCGC 
ATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGAC 
GACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATC 
ATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGC 
GCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTG 
GACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCACATCACCATCATCAC 
CACTAA 

SEQ ID 12: shows the deduced amino acid sequence for BACE N->Q R56KR57K 

MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEEPEEPGKKGSFVEMVDNLRGKSG 
QGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGPCWEGELGTDLV 
SIPHGPQVTVRANIAAITESDKFFIQGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGF 
PLQQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTN 
LRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTQQSFRITILPQQYLR 
PVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMED 
CGYNIPQTDESHHHHHH 

SEQ ID 13: shows the DNA sequence coding for the BACE N->Q R56KR57K no His. 

' ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCATGGCGGGAGTGCTGCCT 
GCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 
CTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCAAG7UVGGGC 
AGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAG 
ATGAC.CGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAAC 
TTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTAGTACCAGAGGCAGCTGTCC 
AGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAA 
GGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCCAGGTCACTGTGCGTGCC 
AACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCCAGGGCTCCAACTGGGAAGGC 
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ATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTT 
GACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCT 
GGCTTCCCGCTCCAGCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGA 
GGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGG 
TATTATGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGC 
AAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCC 
AAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTC 
CCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGG 
AACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCCAGCAGTCCTTCCGC 
ATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGAC 
GACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATC 
ATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGC 
GCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTG 
' GACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCATAG 

SEQ ID 14: shows the deduced amino acid sequence for BACE N->Q R56KR57K no His 

MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEEPEEPGKKGSFVEMVDNLRGKSG 
QGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLV 
SIPHGPQVTVRANIAAITESDKFFIQGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGF 
PLQQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVilVRVEINGQDLKMDCKEYNYDKSIVDSGTTN 
LRLPKECVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTQQSFRITILPQQYLR 
PVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYVVFDRARKRLGFAVSACHVHDEFRTAAVEGPFVTLDMED 
CGYNIPQTDES 

SEQ ID 15: shows the DNA sequence coding for the BACE N->Q R57K. 

ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCATGGCGGGAGTGCTGCCT 

GCCCACGGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCC 

CTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAAGGGC 

AGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAG. 

ATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAAC 

TTTGCAGTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCC 

AGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAA 

GGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCCAGGTCACTGTGCGTGCC 

AACATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCCAGGGCTCCAACTGGGAAGGC 

ATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTT 

GACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCT 

GGCTTCCCCCTCCAGCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGA 

GGTATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGG 

TATTATGAGGTGATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGC 

AAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCC 

AAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTC 

CCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGG 

AACATTTTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCCAGCAGTCCTTCCGC 

ATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGAC 

GACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATC 

ATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGA/^CGAATTGGCTTTGCTGTCAGC 

GCTTGCCATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTG 

GACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCACATCACCATCATCAC 

CACTAA 

SEQ ID 16: shows the deduced amino acid sequence for BACE N->Q R57K 

MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEEPEEPGRKGSFVEMVDNLRGKSG 
QGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLV 
SIPHGPQVTVRANIAAITESDKFFIQGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGF 
PLQQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTN 
LRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTQQSFRITILPQQYLR 
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PVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMED 
CGYNIPQTDESHHHHHH 



SEQ ID 17: shows the DNA sequence coding for the BACE N->Q R57DEL. 

ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCATGGCGGGAGTGCTGCCTGCCCACGGCACCCAGCACGG 
CATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCG 
AGGAGCCCGGCAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATG 
ACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCA 
CCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACA 
CCCAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCCAGGTCACTGTGCGTGCCAAC 
ATTGCTGCCATCACTGAATCAGACAAGTTCTTCATCCAGGGCTCCAACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGA. 
• GATTGCCA'GGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCC 
TGCAGCTTTGTGGTGCTGGCTTCCCCCTCCAGCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGT 
ATCGACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTATGAGGTGATCATTGTGCG 
GGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAGTACAACTATGACT^GAGCATTGTGGACAGTGGCACCA 
CCAACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCT 
GATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATTTTCCCAGTCATCTCACT 
CTACCTAATGGGTGAGGTTACCCAGCAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATG 
TGGCCACGTCCCAAGACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATG 
GAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAGTT 
CAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATGGAAGACTGTGGCTACAACATTCCACAGACAGATGAGT- 
CACATCACCATCATCACCACTAA 

SEQ ID 18: shows the deduced amino acid sequence for BACE N->Q R57del 

MASMTGGQQMGRGSMAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEEPEEPGRGSFVEMVDNLRGKSGQ 

GYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVS ' 

IPHGPQVTVRANIAAITESDKFFIQGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFP 

LQQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNL 

RLPKKVFEAAVKS I KAASSTEKFPDGFWLGEQLVCWQAGTT PWN I FPVI SLYLMGEVTQQS FRITILPQQYLRP 

VEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDC 

GYNIPQTDESHHHHHH 

SEQ ID 19: shows the amino acid sequence of BACE WT R56KR57K crystallised. 

LPRETDEEPEEPGKKGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQ 
■ LSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARP 
DDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVII 
VRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPW 
NIFPVISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKR 
IGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDES 



SEQ ED 20: shows the amino acid sequence of BACE N->Q R56KR57K no His as 
crystallised. 



LPRETDEEPEEPGKKGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQ 
LSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPQVTVRANIAAITESDKFFIQGSNWEGILGLAYAEIARrP 
DDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLQQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVII 
VRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPW 
NIFPVISLYLMGEVTQQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKR 
I G FAVSACHVH DEFRTAAVEG PFVTLDME DCG YNI PQTDES- 

SEQ ID 21: shows the amino acid sequence of BACE N->Q R56KR57K crystallised. 
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LPRETDEEPEEPGKKGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPHPFLHRYYQRQ 
LSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPQVTVRANIAAITESDKFFIQGSNWEGILGLAYAEIARP 
DDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPLQQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVII 
VRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPW 
NIFPVISLYLMGEVTQQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGFYWFDRARKR 
IGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESHHHHHH 
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